by Anthropic

Claude Haiku 4.5.

text closed 200K ctx Transformer Quality 68.1 Elo 1287
Cheapest input
$1.0/M
on Anthropic API
Cheapest output
$5.0/M
on Anthropic API
Fastest
39 tok/s
on OpenRouter
Hosted equiv.
~$1.8/hr
@ 100 tok/s on Anthropic API

Fast, cheap Claude variant for high-throughput inference.

Hosted API only

No self-host path — closed weights.

Claude Haiku 4.5's weights aren't published. Use it via the access providers below.

Where to use it

Cheapest hosted endpoints.

Provider Access $/M in $/M out
Anthropic API api direct $1.0 $5.0 Launch ↗
DeepInfra hosted inference $1.0 $5.0 Launch ↗
OpenRouter api aggregator $1.0 $5.0 Launch ↗
Capability snapshot Full benchmarks →

What it's best at.

Coding 83.0
General knowledge 78.0
Graduate-level science 42.0
Performance

Speed across providers.

Tokens/sec and time-to-first-token measured against the same prompt template on each provider's API.

Provider Tokens/sec TTFT Total
OpenRouter 39.5 3388 ms 4688 ms
Sources

Official references.

FAQ

Frequently asked.

How do I run Claude Haiku 4.5?
Claude Haiku 4.5 is a closed-source API model. The cheapest way to access it is through the API providers listed on this page (direct API, aggregators, and hosted chat UIs).
Where can I access Claude Haiku 4.5?
Claude Haiku 4.5 is available via Anthropic API, DeepInfra, OpenRouter. Each access option lists its own pricing (per million tokens or hourly hosting).
How much does it cost to run Claude Haiku 4.5?
API pricing starts at $1.0/M input tokens and $5.0/M output tokens. Self-hosting cost depends on the GPU you rent — see the Run It Yourself tab.
Is Claude Haiku 4.5 open-source or proprietary?
Claude Haiku 4.5 is a proprietary model from Anthropic. Access is API-only — there are no public weights to download.