Hyperscaler gateways
US
Google Vertex AI.
GCP's managed model platform — Gemini, Claude, Llama under Google IAM + the rest of the GCP stack.
Cheapest 12 models
Where the floor is.
Sorted cheapest-first by $/M input. Useful when you're looking for the floor before picking a model.
Loading...
At a glance
- Service type
- Hyperscaler gateways
- Trust tier
- Tier 1
- Headquarters
- US
- OpenAI-compat
- No
- Open weights
- Yes
- Proprietary
- Yes
When to pick Google Vertex AI
Best for
- Existing AWS / GCP / Azure customers — same IAM, same VPC, same billing.
- Regulated workloads requiring the hyperscaler's compliance frameworks.
- Multi-region production deployments tightly coupled to other cloud services.
Avoid for
- Cost-sensitive workloads — hyperscaler markup over first-party is real.
- Anyone who doesn't already need the surrounding cloud platform.
Models on Google Vertex AI
Pricing + measured speed + self-host alternative, one row per model. Click a column header to sort.
| Model ↕ | Maker ↕ | Access ↕ | $/M in ↕ | $/M out ↕ | Tokens/sec ↕ | TTFT ↕ | Self-host on ↕ | |
|---|---|---|---|---|---|---|---|---|
| Gemini 2.5 Pro | Google DeepMind | api aggregator | $1.25 | $10.0 | — | — | API only | Open → |
| Claude Opus 4.7 | Anthropic | api aggregator | $15.0 | $75.0 | — | — | API only | Open → |
| Claude Sonnet 4.6 | Anthropic | api direct | $3.0 | $15.0 | — | — | API only | Open → |
| Claude 3.5 Sonnet | Anthropic | api direct | $3.0 | $15.0 | — | — | API only | Open → |
| Claude 3.5 Haiku | Anthropic | api direct | $1.0 | $5.0 | — | — | API only | Open → |