Claude 3.5 Sonnet.
Anthropic's 3.5 generation — still in active production.
No self-host path — closed weights.
Claude 3.5 Sonnet's weights aren't published. Use it via the access providers below.
Cheapest hosted endpoints.
Speed across providers.
Tokens/sec and time-to-first-token measured against the same prompt template on each provider's API.
| Provider | Tokens/sec | TTFT | Total |
|---|---|---|---|
| OpenRouter | 32.6 | 1376 ms | 4725 ms |
Variants in the Claude family.
Frontier reasoning and long-form coding from Anthropic.
Best price-performance from Anthropic. Default for production agents.
Fast, cheap Claude variant for high-throughput inference.
Fast/cheap Claude 3.5 variant — production fallback for Haiku 4.5.
Frequently asked.
How do I run Claude 3.5 Sonnet?
Where can I access Claude 3.5 Sonnet?
How much does it cost to run Claude 3.5 Sonnet?
Is Claude 3.5 Sonnet open-source or proprietary?
What it costs per month across providers.
Estimate your monthly bill for Claude 3.5 Sonnet across every host that publishes per-token pricing. Slide your token volumes; the chart + table re-rank cheapest-first.
Cheapest provider on the left.
Total monthly cost — input + output tokens combined.
Bill breakdown.
What it's best at.
Scores normalised against benchmark ceilings (100 = perfect). Coloured by tier — coral 80+ frontier, lavender 65+ strong, sage 50+ solid, slate below.
About Claude 3.5 Sonnet.
Claude 3.5 Sonnet (October 2024 revision) was Anthropic's flagship coding model before Sonnet 4.6. Still extensively deployed in production — many teams pin to 3.5 for reproducibility. Same 200K context, same vision support, same tool-use API as the 4.x series. Cheaper than Sonnet 4.6.