Hyperscaler gateways US

Google Vertex AI.

GCP's managed model platform — Gemini, Claude, Llama under Google IAM + the rest of the GCP stack.

Cheapest 12 models

Where the floor is.

Sorted cheapest-first by $/M input. Useful when you're looking for the floor before picking a model.

Loading...

At a glance

Service type
Hyperscaler gateways
Trust tier
Tier 1
Headquarters
US
OpenAI-compat
No
Open weights
Yes
Proprietary
Yes

When to pick Google Vertex AI

Best for

  • Existing AWS / GCP / Azure customers — same IAM, same VPC, same billing.
  • Regulated workloads requiring the hyperscaler's compliance frameworks.
  • Multi-region production deployments tightly coupled to other cloud services.

Avoid for

  • Cost-sensitive workloads — hyperscaler markup over first-party is real.
  • Anyone who doesn't already need the surrounding cloud platform.

Models on Google Vertex AI

Pricing + measured speed + self-host alternative, one row per model. Click a column header to sort.

5 models · 0 benchmarked
Model ↕ Maker ↕ Access ↕ $/M in ↕ $/M out ↕ Tokens/sec ↕ TTFT ↕ Self-host on ↕
Gemini 2.5 Pro Google DeepMind api aggregator $1.25 $10.0 API only Open →
Claude Opus 4.7 Anthropic api aggregator $15.0 $75.0 API only Open →
Claude Sonnet 4.6 Anthropic api direct $3.0 $15.0 API only Open →
Claude 3.5 Sonnet Anthropic api direct $3.0 $15.0 API only Open →
Claude 3.5 Haiku Anthropic api direct $1.0 $5.0 API only Open →