by Google DeepMind

Google: Gemini 3.1 Flash Lite.

multimodal closed 1M ctx

Cheapest input

$0.25/M

on DeepInfra

Cheapest output

$1.5/M

on DeepInfra

Hosted equiv.

~$0.54/hr

@ 100 tok/s on DeepInfra

Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...

Hosted API only

No self-host path — closed weights.

Google: Gemini 3.1 Flash Lite's weights aren't published. Use it via the access providers below.

Where to use it

Cheapest hosted endpoints.

Provider	Access	$/M in	$/M out
DeepInfra	hosted inference	$0.25	$1.5	Launch ↗
OpenRouter	api aggregator	$0.25	$1.5	Launch ↗

FAQ

Frequently asked.

How do I run Google: Gemini 3.1 Flash Lite?

Google: Gemini 3.1 Flash Lite is a closed-source API model. The cheapest way to access it is through the API providers listed on this page (direct API, aggregators, and hosted chat UIs).

Where can I access Google: Gemini 3.1 Flash Lite?

Google: Gemini 3.1 Flash Lite is available via DeepInfra, OpenRouter. Each access option lists its own pricing (per million tokens or hourly hosting).

How much does it cost to run Google: Gemini 3.1 Flash Lite?

API pricing starts at $0.25/M input tokens and $1.5/M output tokens. Self-hosting cost depends on the GPU you rent — see the Run It Yourself tab.

Is Google: Gemini 3.1 Flash Lite open-source or proprietary?

Google: Gemini 3.1 Flash Lite is a proprietary model from Google DeepMind. Access is API-only — there are no public weights to download.

API pricing

What it costs per month across providers.

Estimate your monthly bill for Google: Gemini 3.1 Flash Lite across every host that publishes per-token pricing. Slide your token volumes; the chart + table re-rank cheapest-first.

Cheapest

$5.5

OpenRouter

$/M input

$0.25

per million tokens

$/M output

$1.5

per million tokens

Providers

with priced rows

Monthly bill

Cheapest provider on the left.

Total monthly cost — input + output tokens combined.

Per provider

Bill breakdown.

Provider	$/M in	$/M out	Input cost	Output cost	Monthly total
OpenRouter	$0.25	$1.5	$2.5	$3.0	$5.5	Sign up ↗
DeepInfra	$0.25	$1.5	$2.5	$3.0	$5.5	Sign up ↗

Full calculator

Want to compare token volumes across other models too? Open the standalone API pricing tool →