Google: Gemini 3.1 Flash Lite.
Gemini 3.1 Flash Lite is Google’s GA high-efficiency multimodal model optimized for low-latency, high-volume workloads. It supports text, image, video, audio, and PDF inputs, and is designed for lightweight agentic...
No self-host path — closed weights.
Google: Gemini 3.1 Flash Lite's weights aren't published. Use it via the access providers below.
Cheapest hosted endpoints.
Speed across providers.
Tokens/sec and time-to-first-token measured against the same prompt template on each provider's API.
| Provider | Tokens/sec | TTFT | Total |
|---|---|---|---|
| OpenRouter | 54.7 | 1302 ms | 1628 ms |
Frequently asked.
How do I run Google: Gemini 3.1 Flash Lite?
Where can I access Google: Gemini 3.1 Flash Lite?
How much does it cost to run Google: Gemini 3.1 Flash Lite?
Is Google: Gemini 3.1 Flash Lite open-source or proprietary?
What it costs per month across providers.
Estimate your monthly bill for Google: Gemini 3.1 Flash Lite across every host that publishes per-token pricing. Slide your token volumes; the chart + table re-rank cheapest-first.
Cheapest provider on the left.
Total monthly cost — input + output tokens combined.