by OpenAI

GPT-4o Mini.

multimodal closed 128K ctx Quality 85.3

Cheapest input

$0.15/M

on OpenAI API

Cheapest output

$0.6/M

on OpenAI API

Hosted equiv.

~$0.22/hr

@ 100 tok/s on OpenAI API

Cheap multimodal default — replaced GPT-3.5 Turbo for low-cost workloads.

Hosted API only

No self-host path — closed weights.

GPT-4o Mini's weights aren't published. Use it via the access providers below.

Where to use it

Cheapest hosted endpoints.

Provider	Access	$/M in	$/M out
OpenAI API	api direct	$0.15	$0.6	Launch ↗
Azure OpenAI Service	api direct	$0.15	$0.6	Launch ↗
OpenRouter	api aggregator	$0.15	$0.6	Launch ↗

Capability snapshot Full benchmarks →

What it's best at.

Coding 87.2

General knowledge 82.0

Sources

Official references.

Launch post ↗

Family

Variants in the GPT family.

GPT-5

closed proprietary

OpenAI's frontier multimodal reasoning model.

GPT-4 Turbo

closed proprietary

OpenAI's pre-GPT-5 flagship — still extensively deployed.

GPT-4o

closed proprietary

OpenAI's multimodal model — text, vision, audio in one.

Best for

Workloads.

Run LLMs (inference / serving)

FAQ

Frequently asked.

How do I run GPT-4o Mini?

GPT-4o Mini is a closed-source API model. The cheapest way to access it is through the API providers listed on this page (direct API, aggregators, and hosted chat UIs).

Where can I access GPT-4o Mini?

GPT-4o Mini is available via OpenAI API, Azure OpenAI Service, OpenRouter. Each access option lists its own pricing (per million tokens or hourly hosting).

How much does it cost to run GPT-4o Mini?

API pricing starts at $0.15/M input tokens and $0.6/M output tokens. Self-hosting cost depends on the GPU you rent — see the Run It Yourself tab.

Is GPT-4o Mini open-source or proprietary?

GPT-4o Mini is a proprietary model from OpenAI. Access is API-only — there are no public weights to download.

API pricing

What it costs per month across providers.

Estimate your monthly bill for GPT-4o Mini across every host that publishes per-token pricing. Slide your token volumes; the chart + table re-rank cheapest-first.

Cheapest

$2.7

OpenAI API

$/M input

$0.15

per million tokens

$/M output

$0.6

per million tokens

Providers

with priced rows

Monthly bill

Cheapest provider on the left.

Total monthly cost — input + output tokens combined.

Per provider

Bill breakdown.

Provider	$/M in	$/M out	Input cost	Output cost	Monthly total
OpenAI API	$0.15	$0.6	$1.5	$1.2	$2.7	Sign up ↗
Azure OpenAI Service	$0.15	$0.6	$1.5	$1.2	$2.7	Sign up ↗
OpenRouter	$0.15	$0.6	$1.5	$1.2	$2.7	Sign up ↗

Full calculator

Want to compare token volumes across other models too? Open the standalone API pricing tool →

Capability snapshot

What it's best at.

Coding 87.2

General knowledge 82.0

Scores normalised against benchmark ceilings (100 = perfect). Coloured by tier — coral 80+ frontier, lavender 65+ strong, sage 50+ solid, slate below.

Benchmarks

Published scores.

Benchmark	Score	Source
MMLU	82.0	official ↗
HumanEval	87.2	official ↗

Description

About GPT-4o Mini.

GPT-4o Mini is OpenAI's cost-sensitive multimodal model — drop-in replacement for GPT-3.5 Turbo at half the price. Supports vision, JSON mode, function calling. 128K context. Most teams use it for high-volume classification, routing, autocomplete, and other cost-dominated workloads where Sonnet/Opus/GPT-5 would be overkill.

Context window

How much it can remember.

128K tokens ≈ 96,000 English words

4K 32K 128K 1M

Max output per call: 16K tokens

Capabilities

What it can do.

✓ Vision input

· Audio input

· Video input

✓ Function calling

· Tool use

✓ JSON mode

✓ Streaming

· Fine-tuning