by OpenAI

GPT-4o Mini.

multimodal closed 128K ctx Quality 85.3
Cheapest input
$0.15/M
on OpenAI API
Cheapest output
$0.6/M
on OpenAI API
Fastest
35 tok/s
on OpenRouter
Hosted equiv.
~$0.22/hr
@ 100 tok/s on OpenAI API
Capability snapshot

What it's best at.

Coding 87.2
General knowledge 82.0

Scores normalised against benchmark ceilings (100 = perfect). Coloured by tier — coral 80+ frontier, lavender 65+ strong, sage 50+ solid, slate below.

Benchmarks

Published scores.

Benchmark Score Source
MMLU 82.0 official ↗
HumanEval 87.2 official ↗
Description

About GPT-4o Mini.

GPT-4o Mini is OpenAI's cost-sensitive multimodal model — drop-in replacement for GPT-3.5 Turbo at half the price. Supports vision, JSON mode, function calling. 128K context. Most teams use it for high-volume classification, routing, autocomplete, and other cost-dominated workloads where Sonnet/Opus/GPT-5 would be overkill.

Context window

How much it can remember.

128K tokens ≈ 96,000 English words
4K 32K 128K 1M
Max output per call: 16K tokens
Capabilities

What it can do.

Vision input
· Audio input
· Video input
Function calling
· Tool use
JSON mode
Streaming
· Fine-tuning