by OpenAI

GPT-4 Turbo.

text closed 128K ctx Quality 86.4
Cheapest input
$10.0/M
on OpenAI API
Cheapest output
$30.0/M
on OpenAI API
Fastest
33 tok/s
on OpenRouter
Hosted equiv.
~$10.8/hr
@ 100 tok/s on OpenAI API

OpenAI's pre-GPT-5 flagship — still extensively deployed.

Hosted API only

No self-host path — closed weights.

GPT-4 Turbo's weights aren't published. Use it via the access providers below.

Where to use it

Cheapest hosted endpoints.

Provider Access $/M in $/M out
OpenAI API api direct $10.0 $30.0 Launch ↗
Azure OpenAI Service api direct $10.0 $30.0 Launch ↗
OpenRouter api aggregator $10.0 $30.0 Launch ↗
Capability snapshot Full benchmarks →

What it's best at.

General knowledge 86.4
Performance

Speed across providers.

Tokens/sec and time-to-first-token measured against the same prompt template on each provider's API.

Provider Tokens/sec TTFT Total
OpenRouter 32.5 874 ms 4056 ms
Sources

Official references.

Best for

Workloads.

FAQ

Frequently asked.

How do I run GPT-4 Turbo?
GPT-4 Turbo is a closed-source API model. The cheapest way to access it is through the API providers listed on this page (direct API, aggregators, and hosted chat UIs).
Where can I access GPT-4 Turbo?
GPT-4 Turbo is available via OpenAI API, Azure OpenAI Service, OpenRouter. Each access option lists its own pricing (per million tokens or hourly hosting).
How much does it cost to run GPT-4 Turbo?
API pricing starts at $10.0/M input tokens and $30.0/M output tokens. Self-hosting cost depends on the GPU you rent — see the Run It Yourself tab.
Is GPT-4 Turbo open-source or proprietary?
GPT-4 Turbo is a proprietary model from OpenAI. Access is API-only — there are no public weights to download.