Claude 3.5 Haiku.
Fast/cheap Claude 3.5 variant — production fallback for Haiku 4.5.
No self-host path — closed weights.
Claude 3.5 Haiku's weights aren't published. Use it via the access providers below.
Cheapest hosted endpoints.
Speed across providers.
Tokens/sec and time-to-first-token measured against the same prompt template on each provider's API.
| Provider | Tokens/sec | TTFT | Total |
|---|---|---|---|
| OpenRouter | 28.8 | 1146 ms | 3964 ms |
Variants in the Claude family.
Frontier reasoning and long-form coding from Anthropic.
Best price-performance from Anthropic. Default for production agents.
Anthropic's 3.5 generation — still in active production.
Fast, cheap Claude variant for high-throughput inference.
Frequently asked.
How do I run Claude 3.5 Haiku?
Where can I access Claude 3.5 Haiku?
How much does it cost to run Claude 3.5 Haiku?
Is Claude 3.5 Haiku open-source or proprietary?
What it costs per month across providers.
Estimate your monthly bill for Claude 3.5 Haiku across every host that publishes per-token pricing. Slide your token volumes; the chart + table re-rank cheapest-first.
Cheapest provider on the left.
Total monthly cost — input + output tokens combined.