AI Model Benchmark Dashboard
Interactive comparison of leading foundation models across key benchmarks — Updated July 2025
Visual Explorer
Full Score Table
Benchmark |
Claude Opus 4 |
Claude Sonnet 4 |
Claude Sonnet 3.7 |
OpenAI o3 |
GPT-4 Turbo |
GPT-4.1 |
Gemini 2.5 Pro |
Llama 3 70B |
Mistral Large |
Sources: public benchmark leaderboards & vendor disclosures, aggregated July 9 2025.