First-party clouds United States · Live data

DeepInfra GPU rental provider logo DeepInfra.

First-party AI cloud — DeepInfra also rents B200 GPU instances on demand (1x/2x/4x/8x configs). Same provider as the hosted-inference API, just a different surface.

At a glance

Business model
First-party clouds
Tier
First-party cloud
Trust tier
Tier 1
Headquarters
United States
Payout
N/A (first-party)

When to pick DeepInfra

Best for

  • Distributed training where InfiniBand interconnect matters.
  • Regulated workloads requiring SOC 2 / HIPAA.
  • Teams that need real human support and SLA-backed uptime.

Avoid for

  • One-off short workloads — the per-hour premium adds up.
  • Hobbyist single-GPU jobs that don't need premium support.
Price history

Daily median on DeepInfra's top GPUs.

Loading...

GPUs available on DeepInfra

GPU $/hr
$2.79/hr Compare →

Compare with peers

Other providers in the same bucket — quick way to sanity-check pricing before committing.

Frequently asked

How is DeepInfra billing measured?
Most providers bill per-second once the instance is running, with a small minimum (often 60 seconds). Some first-party clouds round up to the minute. Headline $/hr is the right comparison unit.
Which regions does DeepInfra offer GPUs in?
P2P marketplaces aggregate hosts worldwide so region varies per offer. First-party clouds and hyperscalers expose explicit region pickers (US, EU, APAC). Filter on the provider's site after clicking through if region matters.
Does DeepInfra offer an SLA?
Hyperscalers (AWS, GCP, Azure, Oracle) publish formal SLAs. First-party clouds (Lambda, CoreWeave) offer support contracts. P2P marketplaces and decentralized networks have no SLA — uptime depends on the individual host.
What's the refund / cancellation policy on DeepInfra?
Per-second billing means you only pay for compute used — stop the instance and billing stops. Pre-paid credits and committed-use discounts have provider-specific terms; check the provider's billing docs before pre-paying.