by Anthropic

Claude Sonnet 4.6.

text closed 200K ctx Transformer Quality 78.9 Elo 1372
Cheapest input
$3.0/M
on Anthropic API
Cheapest output
$15.0/M
on Anthropic API
Fastest
37 tok/s
on OpenRouter
Hosted equiv.
~$5.4/hr
@ 100 tok/s on Anthropic API
Capability snapshot

What it's best at.

Coding 91.5
General knowledge 86.0
Math 82.0
Reasoning 78.0

Scores normalised against benchmark ceilings (100 = perfect). Coloured by tier — coral 80+ frontier, lavender 65+ strong, sage 50+ solid, slate below.

Benchmarks

Published scores.

Benchmark Score Source
GPQA 59.0 official ↗
MATH 82.0 official ↗
MMLU 86.0 official ↗
MMLU-Pro 78.0 official ↗
HumanEval 91.5 official ↗
SWE-bench 77.2 official ↗
Leaderboard standing

Independent rankings.

LMSYS Chatbot Arena
1372
Rank #5 · Elo from blind head-to-head votes
View leaderboard ↗
Artificial Analysis Quality Index
79.0
Composite of reasoning + coding + tool-use benchmarks
View on Artificial Analysis ↗
Description

About Claude Sonnet 4.6.

Claude Sonnet 4.6 is Anthropic's price-performance flagship — fast enough for production traffic, smart enough to handle most agentic tasks Opus could. Released alongside Claude Code as the default model for software-engineering agents, Sonnet 4.6 leads the SWE-bench Verified leaderboard and powers most third-party coding tools. The 4.6 release brought the 1M-token context window option for long-document workflows (e.g. analysing entire codebases or legal contracts in one pass). Supports vision, tool use, function calling, and JSON mode. Cheaper than GPT-5 input-side; comparable output-side.

Architecture

How it's built.

Architecture
Transformer
Knowledge cutoff
Jul 2025
234 days from cutoff to release.
Context window

How much it can remember.

200K tokens ≈ 150,000 English words
4K 32K 128K 1M
Max output per call: 64K tokens
Capabilities

What it can do.

Vision input
· Audio input
· Video input
Function calling
Tool use
JSON mode
Streaming
· Fine-tuning
All access providers

Every place this model is hosted.