by Anthropic

Claude Sonnet 4.6.

text closed 200K ctx Transformer Quality 78.9 Elo 1372

Cheapest input

$3.0/M

on Anthropic API

Cheapest output

$15.0/M

on Anthropic API

Hosted equiv.

~$5.4/hr

@ 100 tok/s on Anthropic API

Best price-performance from Anthropic. Default for production agents.

Hosted API only

No self-host path — closed weights.

Claude Sonnet 4.6's weights aren't published. Use it via the access providers below.

Where to use it Top 5 cheapest · 6 total below

Cheapest hosted endpoints.

Provider	Access	$/M in	$/M out
Anthropic API	api direct	$3.0	$15.0	Launch ↗
AWS Bedrock	api direct	$3.0	$15.0	Launch ↗
Google Vertex AI	api direct	$3.0	$15.0	Launch ↗
DeepInfra	hosted inference	$3.0	$15.0	Launch ↗
OpenRouter	api aggregator	$3.0	$15.0	Launch ↗

Capability snapshot Full benchmarks →

What it's best at.

Coding 91.5

General knowledge 86.0

Math 82.0

Reasoning 78.0

Sources

Official references.

Launch post ↗ Demo / Chat ↗ System card ↗ Docs ↗

Family

Variants in the Claude family.

Claude Opus 4.7

closed proprietary

Frontier reasoning and long-form coding from Anthropic.

Claude 3.5 Sonnet

closed proprietary

Anthropic's 3.5 generation — still in active production.

Claude Haiku 4.5

closed proprietary

Fast, cheap Claude variant for high-throughput inference.

Claude 3.5 Haiku

closed proprietary

Fast/cheap Claude 3.5 variant — production fallback for Haiku 4.5.

Best for

Workloads.

Run LLMs (inference / serving) General ML research and experiments

FAQ

Frequently asked.

How do I run Claude Sonnet 4.6?

Claude Sonnet 4.6 is a closed-source API model. The cheapest way to access it is through the API providers listed on this page (direct API, aggregators, and hosted chat UIs).

Where can I access Claude Sonnet 4.6?

Claude Sonnet 4.6 is available via Anthropic API, Poe by Quora, AWS Bedrock, Google Vertex AI, DeepInfra. Each access option lists its own pricing (per million tokens or hourly hosting).

How much does it cost to run Claude Sonnet 4.6?

API pricing starts at $3.0/M input tokens and $15.0/M output tokens. Self-hosting cost depends on the GPU you rent — see the Run It Yourself tab.

Is Claude Sonnet 4.6 open-source or proprietary?

Claude Sonnet 4.6 is a proprietary model from Anthropic. Access is API-only — there are no public weights to download.

API pricing

What it costs per month across providers.

Estimate your monthly bill for Claude Sonnet 4.6 across every host that publishes per-token pricing. Slide your token volumes; the chart + table re-rank cheapest-first.

Cheapest

$60.0

Anthropic API

$/M input

$3.0

per million tokens

$/M output

$15.0

per million tokens

Providers

with priced rows

Monthly bill

Cheapest provider on the left.

Total monthly cost — input + output tokens combined.

Per provider

Bill breakdown.

Provider	$/M in	$/M out	Input cost	Output cost	Monthly total
Anthropic API	$3.0	$15.0	$30.0	$30.0	$60.0	Sign up ↗
AWS Bedrock	$3.0	$15.0	$30.0	$30.0	$60.0	Sign up ↗
Google Vertex AI	$3.0	$15.0	$30.0	$30.0	$60.0	Sign up ↗
OpenRouter	$3.0	$15.0	$30.0	$30.0	$60.0	Sign up ↗
DeepInfra	$3.0	$15.0	$30.0	$30.0	$60.0	Sign up ↗

Full calculator

Want to compare token volumes across other models too? Open the standalone API pricing tool →

Capability snapshot

What it's best at.

Coding 91.5

General knowledge 86.0

Math 82.0

Reasoning 78.0

Scores normalised against benchmark ceilings (100 = perfect). Coloured by tier — coral 80+ frontier, lavender 65+ strong, sage 50+ solid, slate below.

Benchmarks

Published scores.

Benchmark	Score	Source
GPQA	59.0	official ↗
MATH	82.0	official ↗
MMLU	86.0	official ↗
MMLU-Pro	78.0	official ↗
HumanEval	91.5	official ↗
SWE-bench	77.2	official ↗

Leaderboard standing

Independent rankings.

LMSYS Chatbot Arena

1372

Rank #5 · Elo from blind head-to-head votes

View leaderboard ↗

Artificial Analysis Quality Index

79.0

Composite of reasoning + coding + tool-use benchmarks

View on Artificial Analysis ↗

Description

About Claude Sonnet 4.6.

Claude Sonnet 4.6 is Anthropic's price-performance flagship — fast enough for production traffic, smart enough to handle most agentic tasks Opus could. Released alongside Claude Code as the default model for software-engineering agents, Sonnet 4.6 leads the SWE-bench Verified leaderboard and powers most third-party coding tools. The 4.6 release brought the 1M-token context window option for long-document workflows (e.g. analysing entire codebases or legal contracts in one pass). Supports vision, tool use, function calling, and JSON mode. Cheaper than GPT-5 input-side; comparable output-side.

Architecture

How it's built.

Architecture

Transformer

Knowledge cutoff

Jul 2025

234 days from cutoff to release.

Context window

How much it can remember.

200K tokens ≈ 150,000 English words

4K 32K 128K 1M

Max output per call: 64K tokens

Capabilities

What it can do.

✓ Vision input

· Audio input

· Video input

✓ Function calling

✓ Tool use

✓ JSON mode

✓ Streaming

· Fine-tuning

All access providers

Every place this model is hosted.

Anthropic API

api direct

$3.0 / $15.0 per M (in/out)

Visit Anthropic API ↗

Poe by Quora

chat ui

Subscription includes Claude usage

AWS Bedrock

api direct

$3.0 / $15.0 per M (in/out)

Anthropic models on AWS Bedrock — same pricing as Anthropic direct + AWS infrastructure costs (storage / data transfer). Requires AWS account + Bedrock model access request.

Visit AWS Bedrock ↗

Google Vertex AI

api direct

$3.0 / $15.0 per M (in/out)

Anthropic models on Google Vertex AI — same pricing as Anthropic direct, billed via GCP. Requires Google Cloud project + Vertex AI Model Garden access.

Visit Google Vertex AI ↗

DeepInfra

hosted inference

$3.0 / $15.0 per M (in/out)

Visit DeepInfra ↗

OpenRouter

api aggregator

$3.0 / $15.0 per M (in/out)

Visit OpenRouter ↗