by DeepSeek

DeepSeek V3.

text open weights datacenter 671B params 37B active 128K ctx MoE Quality 76.1
Cheapest input
$0.27/M
on DeepSeek API
Cheapest output
$0.89/M
on DeepInfra
Fastest
30 tok/s
on OpenRouter
Smallest GPU
2× AMD MI325

DeepSeek's flagship MoE — 671B total, 37B active, frontier-class.

Smallest GPU to run it See all quantisations →

2× AMD MI325.

Most-aggressive quantisation we have a working recommendation for. Lower precision = less VRAM = cheaper hardware, at a small accuracy cost.

Where to use it Top 2 cheapest · 7 total below

Cheapest hosted endpoints.

Provider Access $/M in $/M out
DeepSeek API api direct $0.27 $1.1 Launch ↗
DeepInfra hosted inference $0.32 $0.89 Launch ↗
Capability snapshot Full benchmarks →

What it's best at.

Math 90.2
General knowledge 88.5
Coding 82.6
Reasoning 75.9
Performance

Speed across providers.

Tokens/sec and time-to-first-token measured against the same prompt template on each provider's API.

Provider Tokens/sec TTFT Total
OpenRouter 30.3 1207 ms 4324 ms
FAQ

Frequently asked.

How do I run DeepSeek V3?
DeepSeek V3 is open-weight, so you can self-host on rented GPUs. See the Run It Yourself tab for GPU configurations + cost estimates, or use one of the hosted inference providers listed on this page.
Where can I access DeepSeek V3?
DeepSeek V3 is available via Self-hosted on rented GPU cluster, DeepSeek API, Together AI, Fireworks AI, OpenRouter. Each access option lists its own pricing (per million tokens or hourly hosting).
How much does it cost to run DeepSeek V3?
API pricing starts at $0.27/M input tokens and $1.1/M output tokens. Self-hosting cost depends on the GPU you rent — see the Run It Yourself tab.
Is DeepSeek V3 open-source or proprietary?
DeepSeek V3 is open-weight under the DeepSeek License (commercial) license. You can download and self-host it.