by Anthropic

Claude Haiku 4.5.

text closed 200K ctx Transformer Quality 68.1 Elo 1287
Cheapest input
$1.0/M
on Anthropic API
Cheapest output
$5.0/M
on Anthropic API
Fastest
39 tok/s
on OpenRouter
Hosted equiv.
~$1.8/hr
@ 100 tok/s on Anthropic API
Capability snapshot

What it's best at.

Coding 83.0
General knowledge 78.0
Graduate-level science 42.0

Scores normalised against benchmark ceilings (100 = perfect). Coloured by tier — coral 80+ frontier, lavender 65+ strong, sage 50+ solid, slate below.

Benchmarks

Published scores.

Benchmark Score Source
GPQA 42.0 official ↗
MMLU 78.0 official ↗
HumanEval 83.0 official ↗
Leaderboard standing

Independent rankings.

LMSYS Chatbot Arena
1287
Elo from blind head-to-head votes
View leaderboard ↗
Artificial Analysis Quality Index
65.0
Composite of reasoning + coding + tool-use benchmarks
View on Artificial Analysis ↗
Description

About Claude Haiku 4.5.

Claude Haiku 4.5 is Anthropic's fastest and cheapest model — built for high-throughput inference workloads (chat bots, classification, summarisation, autocomplete) where latency and cost dominate. Quality is roughly two tiers below Sonnet but still strong on shorter prompts. Same 200K context as the rest of Claude 4.x, same vision support, same tool-use API. Bedrock + Vertex AI mirror the model under enterprise contracts. Most teams use Haiku for the cost-sensitive layer of a pipeline (e.g. routing prompts to Sonnet only when needed).

Architecture

How it's built.

Architecture
Transformer
Knowledge cutoff
Apr 2025
183 days from cutoff to release.
Context window

How much it can remember.

200K tokens ≈ 150,000 English words
4K 32K 128K 1M
Max output per call: 8K tokens
Capabilities

What it can do.

Vision input
· Audio input
· Video input
Function calling
Tool use
JSON mode
Streaming
· Fine-tuning