First-party clouds
United States
· Live data
DeepInfra.
First-party AI cloud — DeepInfra also rents B200 GPU instances on demand (1x/2x/4x/8x configs). Same provider as the hosted-inference API, just a different surface.
At a glance
- Business model
- First-party clouds
- Tier
- First-party cloud
- Trust tier
- Tier 1
- Headquarters
- United States
- Payout
- N/A (first-party)
When to pick DeepInfra
Best for
- Distributed training where InfiniBand interconnect matters.
- Regulated workloads requiring SOC 2 / HIPAA.
- Teams that need real human support and SLA-backed uptime.
Avoid for
- One-off short workloads — the per-hour premium adds up.
- Hobbyist single-GPU jobs that don't need premium support.
Price history
Daily median on DeepInfra's top GPUs.
Loading...
GPUs available on DeepInfra
| GPU | Tier | $/hr | |
|---|---|---|---|
| datacenter | $2.79/hr | Compare → |
Compare with peers
Other providers in the same bucket — quick way to sanity-check pricing before committing.
Cheapest: $0.18/hr
· Nvidia RTX 4090
· 3 GPUs
AI and agent cloud platform providing on-demand GPU rentals for training and inference workloads with high performance and cost efficiency.
Cheapest: $0.85/hr
· Nvidia Tesla V100
· 3 GPUs
Cheapest: $0.32/hr
· Nvidia Nvidia RTX A4000
· 6 GPUs
Cheapest: $0.75/hr
· Nvidia A10
· 7 GPUs
Frequently asked
How is DeepInfra billing measured?
Most providers bill per-second once the instance is running, with a small minimum (often 60 seconds). Some first-party clouds round up to the minute. Headline $/hr is the right comparison unit.
Which regions does DeepInfra offer GPUs in?
P2P marketplaces aggregate hosts worldwide so region varies per offer. First-party clouds and hyperscalers expose explicit region pickers (US, EU, APAC). Filter on the provider's site after clicking through if region matters.
Does DeepInfra offer an SLA?
Hyperscalers (AWS, GCP, Azure, Oracle) publish formal SLAs. First-party clouds (Lambda, CoreWeave) offer support contracts. P2P marketplaces and decentralized networks have no SLA — uptime depends on the individual host.
What's the refund / cancellation policy on DeepInfra?
Per-second billing means you only pay for compute used — stop the instance and billing stops. Pre-paid credits and committed-use discounts have provider-specific terms; check the provider's billing docs before pre-paying.