Tool · Run a model

Will my model fit on this GPU?

Compute the VRAM you need for a given AI model at a chosen precision and context window, and see exactly which rentable GPUs have the headroom — plus the cheapest provider for each.

Active params

70.0B

Dense

Bytes / param

2.0

FP16

Context

128,000

tokens

VRAM required

179 GB

weights + KV + 20% headroom

GPUs that fit

Sorted by VRAM ascending — smallest fitting card first (usually the cheapest).

GPU	VRAM	FP16 TFLOPS	TDP	Cheapest provider	$/hr
AMD AMD MI300X	192GB	—	—	no live offers	—	Open →
AMD MI300	192GB	—	750W	no live offers	—	Open →
Nvidia Nvidia B200	192GB	—	—	DeepInfra	$3.69/hr	Open →
AMD MI325	256GB	—	1000W	no live offers	—	Open →
Nvidia GB300	288GB	—	2700W	no live offers	—	Open →
Nvidia Nvidia B300	288GB	—	—	io.net	$5.38/hr	Open →
AMD MI355X	288GB	—	1400W	no live offers	—	Open →

Related tools

Self-host vs API breakeven →
API monthly bill →
GPU side-by-side →