by Google DeepMind
Gemini 1.5 Flash.
multimodal
closed
1M ctx
Cheapest input
$0.075/M
on Google AI Studio
Cheapest output
$0.3/M
on Google AI Studio
Hosted equiv.
~$0.11/hr
@ 100 tok/s on Google AI Studio
Cheap fast Gemini — production default before 2.0/2.5 Flash.
Hosted API only
No self-host path — closed weights.
Gemini 1.5 Flash's weights aren't published. Use it via the access providers below.
Where to use it
Cheapest hosted endpoints.
| Provider | Access | $/M in | $/M out | |
|---|---|---|---|---|
| Google AI Studio | api direct | $0.075 | $0.3 |
Family
Variants in the Gemini family.
FAQ
Frequently asked.
How do I run Gemini 1.5 Flash?
Gemini 1.5 Flash is a closed-source API model. The cheapest way to access it is through the API providers listed on this page (direct API, aggregators, and hosted chat UIs).
Where can I access Gemini 1.5 Flash?
Gemini 1.5 Flash is available via Google AI Studio. Each access option lists its own pricing (per million tokens or hourly hosting).
How much does it cost to run Gemini 1.5 Flash?
API pricing starts at $0.075/M input tokens and $0.3/M output tokens. Self-hosting cost depends on the GPU you rent — see the Run It Yourself tab.
Is Gemini 1.5 Flash open-source or proprietary?
Gemini 1.5 Flash is a proprietary model from Google DeepMind. Access is API-only — there are no public weights to download.
API pricing
Per provider
What it costs per month across providers.
Estimate your monthly bill for Gemini 1.5 Flash across every host that publishes per-token pricing. Slide your token volumes; the chart + table re-rank cheapest-first.
Cheapest
$1.35
Google AI Studio
$/M input
$0.075
per million tokens
$/M output
$0.3
per million tokens
Providers
1
with priced rows
Monthly bill
Cheapest provider on the left.
Total monthly cost — input + output tokens combined.
Loading...
Bill breakdown.
| Provider | Monthly total | |
|---|---|---|
| $1.35 |
Full calculator
Want to compare token volumes across other models too?
Open the standalone API pricing tool →
Context window
How much it can remember.
1M tokens
≈ 750,000 English words
4K
32K
128K
1M
Max output per call: 8K tokens
Capabilities
What it can do.
✓
Vision input
✓
Audio input
✓
Video input
✓
Function calling
·
Tool use
✓
JSON mode
✓
Streaming
·
Fine-tuning