by DeepSeek

DeepSeek R1.

text open weights datacenter 671B params 37B active 128K ctx MoE Quality 82.7
Cheapest input
$0.55/M
on DeepSeek API
Cheapest output
$2.19/M
on DeepSeek API
Fastest
22 tok/s
on OpenRouter
Smallest GPU
2× AMD MI325
Capability snapshot

What it's best at.

Math 97.3
Coding 90.0
Reasoning 84.0
Graduate-level science 71.5

Scores normalised against benchmark ceilings (100 = perfect). Coloured by tier — coral 80+ frontier, lavender 65+ strong, sage 50+ solid, slate below.

Benchmarks

Published scores.

Benchmark Score Source
GPQA 71.5 official ↗
MATH 97.3 official ↗
MMLU-Pro 84.0 official ↗
HumanEval 90.0 official ↗
Leaderboard standing

Independent rankings.

Artificial Analysis Quality Index
72.0
Composite of reasoning + coding + tool-use benchmarks
View on Artificial Analysis ↗
Description

About DeepSeek R1.

DeepSeek R1 is DeepSeek AI's reasoning model — trained primarily with reinforcement learning (no SFT bootstrap) on top of DeepSeek V3. Pioneers thinking aloud style chain-of-thought; rivals OpenAI's o1 on math and coding reasoning benchmarks. The January 2025 launch sent shockwaves through the AI industry because the model was released under the unrestricted MIT license, with the training methodology fully documented. Also drove the famous DeepSeek shock sell-off of US tech stocks. 671B/37B MoE under the hood; quantised distilled versions (1.5B to 70B) also released and widely deployed for cost-sensitive reasoning workloads.

Architecture

How it's built.

Architecture
MoE
Mixture of Experts — 37B params active per token out of 671B total.
Knowledge cutoff
Jul 2024
203 days from cutoff to release.
Context window

How much it can remember.

128K tokens ≈ 96,000 English words
4K 32K 128K 1M
Max output per call: 33K tokens
Capabilities

What it can do.

· Vision input
· Audio input
· Video input
Function calling
· Tool use
· JSON mode
Streaming
Fine-tuning