by Google DeepMind

Google: Gemini 3.1 Pro Preview.

multimodal closed 1M ctx
Cheapest input
$2.0/M
on OpenRouter
Cheapest output
$12.0/M
on OpenRouter
Fastest
88 tok/s
on OpenRouter
Hosted equiv.
~$4.32/hr
@ 100 tok/s on OpenRouter

Gemini 3.1 Pro Preview is Google’s frontier reasoning model, delivering enhanced software engineering performance, improved agentic reliability, and more efficient token usage across complex workflows. Building on the mu

Hosted API only

No self-host path — closed weights.

Google: Gemini 3.1 Pro Preview's weights aren't published. Use it via the access providers below.

Where to use it

Cheapest hosted endpoints.

Provider Access $/M in $/M out
OpenRouter api aggregator $2.0 $12.0 Launch ↗
Performance

Speed across providers.

Tokens/sec and time-to-first-token measured against the same prompt template on each provider's API.

Provider Tokens/sec TTFT Total
OpenRouter 88.2 5298 ms 5627 ms
FAQ

Frequently asked.

How do I run Google: Gemini 3.1 Pro Preview?
Google: Gemini 3.1 Pro Preview is a closed-source API model. The cheapest way to access it is through the API providers listed on this page (direct API, aggregators, and hosted chat UIs).
Where can I access Google: Gemini 3.1 Pro Preview?
Google: Gemini 3.1 Pro Preview is available via OpenRouter. Each access option lists its own pricing (per million tokens or hourly hosting).
How much does it cost to run Google: Gemini 3.1 Pro Preview?
API pricing starts at $2.0/M input tokens and $12.0/M output tokens. Self-hosting cost depends on the GPU you rent — see the Run It Yourself tab.
Is Google: Gemini 3.1 Pro Preview open-source or proprietary?
Google: Gemini 3.1 Pro Preview is a proprietary model from Google DeepMind. Access is API-only — there are no public weights to download.