Best AI models for knowledge
MMLU Pro tests broad knowledge across academic and professional subjects with harder, reasoning-heavy questions than the original MMLU. A strong score indicates wide, reliable factual coverage.
- 🥇
gemini-3-pro-previewGoogle LLC (Gemini API)·$2.00 / $12.00 per 1M89.8%89.8% - 🥈claude-opus-4-5Anthropic PBC·$5.00 / $25.00 per 1M89.5%89.5%
- 🥉
gemini-3-flash-previewGoogle LLC (Gemini API)·$0.50 / $3.00 per 1M89.0%89.0% - 4
gpt-5.2-chatOpenAI Inc.·$1.75 / $14.00 per 1M87.4%87.4% - 5
gpt-5-chatOpenAI Inc.·$1.25 / $10.00 per 1M87.1%87.1% - 6
gpt-5.1OpenAI Inc.·$1.25 / $10.00 per 1M87.0%87.0% - 7grok-4xAI Corp.·$3.00 / $15.00 per 1M86.6%86.6%
- 8
gpt-5-codexOpenAI Responses·$1.25 / $10.00 per 1M86.5%86.5% - 9
gemini-2.5-proGoogle LLC (Gemini API)·$1.25 / $10.00 per 1M86.2%86.2% - 10
deepseek-v3.2Google LLC (Vertex AI)·$0.56 / $1.68 per 1M86.2%86.2% - 11
gpt-5.1-codexOpenAI Responses·$1.25 / $10.00 per 1M86.0%86.0% - 12
GLM-4.7Z AI·$0.60 / $2.20 per 1M85.6%85.6% - 13
o3OpenAI Inc.·$2.00 / $8.00 per 1M85.3%85.3% - 14grok-4-fastxAI Corp.·$0.20 / $0.50 per 1M85.0%85.0%
- 15
deepseek-r1-turboNovita AI·$0.70 / $2.50 per 1M84.9%84.9% - 16
kimi-k2Google LLC (Vertex AI)·$0.60 / $2.50 per 1M84.8%84.8% - 17
xiaomimimo/mimo-v2-flashNovita AI·$0.10 / $0.30 per 1M84.3%84.3% - 18
Qwen/Qwen3-235B-A22B-Instruct-2507DeepInfra Inc.·$0.07 / $0.10 per 1M84.3%84.3% - 19
o1OpenAI Inc.·$15.00 / $60.00 per 1M84.1%84.1% - 20
gpt-5-miniOpenAI Inc.·$0.25 / $2.00 per 1M83.7%83.7% - 21
deepseek-ai/DeepSeek-V3.1DeepInfra Inc.·$0.30 / $1.00 per 1M83.3%83.3% - 22
o4-miniOpenAI Inc.·$1.10 / $4.40 per 1M83.2%83.2% - 23
gemini-2.5-flashGoogle LLC (Gemini API)·$0.30 / $2.50 per 1M83.2%83.2% - 24
GLM-4.6Z AI·$0.60 / $2.20 per 1M82.9%82.9% - 25grok-3-minixAI Corp.·$0.30 / $0.50 per 1M82.8%82.8%
- 26MiniMax-M2MiniMax·$0.30 / $1.20 per 1M82.0%82.0%
- 27
deepseek-v3-0324Novita AI·$0.40 / $1.30 per 1M81.9%81.9% - 28
zai-org/GLM-4.5-AirDeepInfra Inc.·$0.20 / $1.10 per 1M81.5%81.5% - 29gpt-oss-120bGroq Inc.·$0.15 / $0.75 per 1M80.8%80.8%
- 30
gpt-4.1OpenAI Inc.·$2.00 / $8.00 per 1M80.6%80.6%
Explore other rankings
Smartest overall
Ranked by Intelligence Index
Best for coding
Ranked by Coding Index
Best coding agent
Ranked by Terminal-Bench Hard
Best for reasoning
Ranked by GPQA Diamond
Best at math
Ranked by Math Index
Best for tool use
Ranked by τ²-Bench
Cheapest
Lowest input + output price per 1M tokens
Longest context
Max tokens in a single prompt
How we rank
Scores for MMLU Pro come from Artificial Analysis, an independent AI benchmarking service. When a model is available through multiple providers (e.g. Anthropic direct, AWS Bedrock, Google Vertex), we show one canonical entry per model family so the ranking isn't polluted by duplicates. Benchmarks measure specific skills — always validate on your own workload before committing.
One API for every model on this list
Requesty is OpenAI-compatible and routes to 400+ models. Switch between any of the models above by changing one parameter in your code.
Get started free