by Anthropic

Claude 3.5 Sonnet.

text closed 200K ctx Transformer Quality 90.8

Cheapest input

$3.0/M

on Anthropic API

Cheapest output

$15.0/M

on Anthropic API

Hosted equiv.

~$5.4/hr

@ 100 tok/s on Anthropic API

Anthropic's 3.5 generation — still in active production.

Hosted API only

No self-host path — closed weights.

Claude 3.5 Sonnet's weights aren't published. Use it via the access providers below.

Where to use it

Cheapest hosted endpoints.

Provider	Access	$/M in	$/M out
Anthropic API	api direct	$3.0	$15.0	Launch ↗
AWS Bedrock	api aggregator	$3.0	$15.0
Google Vertex AI	api direct	$3.0	$15.0	Launch ↗

Capability snapshot Full benchmarks →

What it's best at.

Coding 92.0

General knowledge 88.7

Sources

Official references.

Launch post ↗ Demo / Chat ↗ Docs ↗

Family

Variants in the Claude family.

Claude Opus 4.7

closed proprietary

Frontier reasoning and long-form coding from Anthropic.

Claude Sonnet 4.6

closed proprietary

Best price-performance from Anthropic. Default for production agents.

Claude Haiku 4.5

closed proprietary

Fast, cheap Claude variant for high-throughput inference.

Claude 3.5 Haiku

closed proprietary

Fast/cheap Claude 3.5 variant — production fallback for Haiku 4.5.

Best for

Workloads.

Run LLMs (inference / serving)

FAQ

Frequently asked.

How do I run Claude 3.5 Sonnet?

Claude 3.5 Sonnet is a closed-source API model. The cheapest way to access it is through the API providers listed on this page (direct API, aggregators, and hosted chat UIs).

Where can I access Claude 3.5 Sonnet?

Claude 3.5 Sonnet is available via Anthropic API, AWS Bedrock, Google Vertex AI, OpenRouter. Each access option lists its own pricing (per million tokens or hourly hosting).

How much does it cost to run Claude 3.5 Sonnet?

API pricing starts at $3.0/M input tokens and $15.0/M output tokens. Self-hosting cost depends on the GPU you rent — see the Run It Yourself tab.

Is Claude 3.5 Sonnet open-source or proprietary?

Claude 3.5 Sonnet is a proprietary model from Anthropic. Access is API-only — there are no public weights to download.

API pricing

What it costs per month across providers.

Estimate your monthly bill for Claude 3.5 Sonnet across every host that publishes per-token pricing. Slide your token volumes; the chart + table re-rank cheapest-first.

Cheapest

$60.0

Anthropic API

$/M input

$3.0

per million tokens

$/M output

$15.0

per million tokens

Providers

with priced rows

Monthly bill

Cheapest provider on the left.

Total monthly cost — input + output tokens combined.

Per provider

Bill breakdown.

Provider	$/M in	$/M out	Input cost	Output cost	Monthly total
Anthropic API	$3.0	$15.0	$30.0	$30.0	$60.0	Sign up ↗
AWS Bedrock	$3.0	$15.0	$30.0	$30.0	$60.0
Google Vertex AI	$3.0	$15.0	$30.0	$30.0	$60.0	Sign up ↗

Full calculator

Want to compare token volumes across other models too? Open the standalone API pricing tool →

Capability snapshot

What it's best at.

Coding 92.0

General knowledge 88.7

Scores normalised against benchmark ceilings (100 = perfect). Coloured by tier — coral 80+ frontier, lavender 65+ strong, sage 50+ solid, slate below.

Benchmarks

Published scores.

Benchmark	Score	Source
MMLU	88.7	official ↗
HumanEval	92.0	official ↗

Description

About Claude 3.5 Sonnet.

Claude 3.5 Sonnet (October 2024 revision) was Anthropic's flagship coding model before Sonnet 4.6. Still extensively deployed in production — many teams pin to 3.5 for reproducibility. Same 200K context, same vision support, same tool-use API as the 4.x series. Cheaper than Sonnet 4.6.

Architecture

How it's built.

Architecture

Transformer

Knowledge cutoff

Apr 2024

80 days from cutoff to release.

Context window

How much it can remember.

200K tokens ≈ 150,000 English words

4K 32K 128K 1M

Max output per call: 8K tokens

Capabilities

What it can do.

✓ Vision input

· Audio input

· Video input

✓ Function calling

✓ Tool use

· JSON mode

✓ Streaming

· Fine-tuning