by Alibaba (Qwen Team)

Qwen 2.5 72B.

text open weights workstation 73B params 128K ctx Transformer Quality 73.0
Cheapest input
$0.36/M
on Alibaba DashScope
Cheapest output
$0.4/M
on Alibaba DashScope
Fastest
26 tok/s
on OpenRouter
Smallest GPU
1× Nvidia RTX PRO 5000 Blackwell
Capability snapshot

What it's best at.

Coding 86.6
General knowledge 86.1
Math 83.1
Reasoning 71.1

Scores normalised against benchmark ceilings (100 = perfect). Coloured by tier — coral 80+ frontier, lavender 65+ strong, sage 50+ solid, slate below.

Benchmarks

Published scores.

Benchmark Score Source
GPQA 49.0 official ↗
MATH 83.1 official ↗
MMLU 86.1 official ↗
MMLU-Pro 71.1 official ↗
HumanEval 86.6 official ↗
Leaderboard standing

Independent rankings.

Artificial Analysis Quality Index
56.0
Composite of reasoning + coding + tool-use benchmarks
View on Artificial Analysis ↗
Description

About Qwen 2.5 72B.

Qwen 2.5 72B is Alibaba's flagship open-weight model in the 2.5 generation (released September 2024). Strong on multilingual tasks (29 languages with deep optimisation for Chinese, Japanese, Korean) and competitive with Llama 3.1 70B on most English benchmarks. 128K context. Available on Hugging Face, ModelScope, and via Alibaba Cloud's DashScope API. License permits commercial use with attribution; restrictions on military and high-risk applications.

Architecture

How it's built.

Architecture
Transformer
Trained on
18.0T tokens
248 tokens per parameter — well above the Chinchilla optimum.
Knowledge cutoff
Apr 2024
171 days from cutoff to release.
Context window

How much it can remember.

128K tokens ≈ 96,000 English words
4K 32K 128K 1M
Max output per call: 8K tokens
Capabilities

What it can do.

· Vision input
· Audio input
· Video input
Function calling
Tool use
JSON mode
Streaming
Fine-tuning