Ai Benchmarks

Tech Crunch - Apr 20th, 2025
Score 6.8

OpenAI’s o3 AI model scores lower on a benchmark than the company initially implied

OpenAI's o3 benchmark results spark transparency debate in AI community

Forbes - Apr 13th, 2025
Score 6.8

Beyond The Llama Drama: 4 New Benchmarks For Large Language Models

Llama 4 controversy highlights flaws in AI benchmark evaluations

Tech Crunch - Apr 11th, 2025
Score 7.0

Meta’s vanilla Maverick AI model ranks below rivals on a popular chat benchmark

Meta's Llama 4 Maverick struggles after AI benchmark controversy.

Tech Crunch - Apr 9th, 2025
Score 7.2

OpenAI launches program to design new ‘domain-specific’ AI benchmarks

OpenAI launches Pioneers Program to revamp broken AI benchmarks.

The Verge - Apr 8th, 2025
Score 6.8

Meta got caught gaming AI benchmarks

Meta's Maverick AI scores high but raises benchmark fairness issues.

Tech Crunch - Mar 30th, 2025
Score 5.0

The hottest AI models, what they do, and how to use them

AI Models Flood Market: From Google's Gemini to OpenAI's Orion

Tech Crunch - Mar 25th, 2025
Score 7.6

A new, challenging AGI test stumps most AI models

ARC-AGI-2 stumps AI models; new benchmark challenges AI intelligence limits.

Previous Next

Showing 1 to 7 of 7 results