MMLU and MMLU-Pro combined.

Best AI models for general knowledge.

MMLU measures breadth across 57 academic subjects; MMLU-Pro raises the bar on the same domains. A high score means the model knows a lot before it has to reason.

Benchmarks used: MMLU · 50% MMLU PRO · 50%

Showing top 25 models with published data on at least one of the benchmarks above. Scores are weighted averages on a 0–100 scale.

AI model leaderboards

More leaderboards.