Meta’s unmodified Llama 4 Maverick AI model ranks below competitors like GPT-4o and Claude 3.5 Sonnet in a popular chat benchmark, raising questions about benchmark optimization and model reliability.
Meta’s unmodified Llama 4 Maverick AI model ranks below competitors like GPT-4o and Claude 3.5 Sonnet in a popular chat benchmark, raising questions about benchmark optimization and model reliability.