First-party clouds United States · Live data

DeepInfra.

First-party AI cloud — DeepInfra also rents B200 GPU instances on demand (1x/2x/4x/8x configs). Same provider as the hosted-inference API, just a different surface.

At a glance

Business model: First-party clouds
Tier: First-party cloud
Trust tier: Tier 1
Headquarters: United States
Payout: N/A (first-party)

When to pick DeepInfra

Best for

Distributed training where InfiniBand interconnect matters.
Regulated workloads requiring SOC 2 / HIPAA.
Teams that need real human support and SLA-backed uptime.

Avoid for

One-off short workloads — the per-hour premium adds up.
Hobbyist single-GPU jobs that don't need premium support.

Price history

Daily median on DeepInfra's top GPUs.

GPUs available on DeepInfra

GPU	Tier	VRAM	$/hr	Range	Offers
Nvidia Nvidia B200	datacenter	192GB	$3.69/hr	$3.69/hr–$3.69/hr	4	Compare →

Compare with peers

Other providers in the same bucket — quick way to sanity-check pricing before committing.

Novita AI

Tier 2

Catalog-only — no live pricing yet

AI and agent cloud platform providing on-demand GPU rentals for training and inference workloads with high performance and cost efficiency.

AceCloud

Catalog-only — no live pricing yet

Hyperstack

Catalog-only — no live pricing yet

Lambda

Catalog-only — no live pricing yet

Frequently asked

How is DeepInfra billing measured?

Most providers bill per-second once the instance is running, with a small minimum (often 60 seconds). Some first-party clouds round up to the minute. Headline $/hr is the right comparison unit.

Which regions does DeepInfra offer GPUs in?

P2P marketplaces aggregate hosts worldwide so region varies per offer. First-party clouds and hyperscalers expose explicit region pickers (US, EU, APAC). Filter on the provider's site after clicking through if region matters.

Does DeepInfra offer an SLA?

Hyperscalers (AWS, GCP, Azure, Oracle) publish formal SLAs. First-party clouds (Lambda, CoreWeave) offer support contracts. P2P marketplaces and decentralized networks have no SLA — uptime depends on the individual host.

What's the refund / cancellation policy on DeepInfra?

Per-second billing means you only pay for compute used — stop the instance and billing stops. Pre-paid credits and committed-use discounts have provider-specific terms; check the provider's billing docs before pre-paying.