GPU group

Cheap inference.

Mid-range GPUs that run quantized 7B–13B models at the lowest hourly rates.

GPU VRAM From
12GB $0.019/hr Compare →
12GB $0.020/hr Compare →
12GB $0.022/hr Compare →
20GB $0.025/hr Compare →
16GB $0.026/hr Compare →
16GB $0.051/hr Compare →
12GB $0.055/hr Compare →
16GB $0.059/hr Compare →
16GB $0.062/hr Compare →
24GB $0.063/hr Compare →
12GB $0.071/hr Compare →
12GB $0.080/hr Compare →
16GB $0.093/hr Compare →
16GB $0.11/hr Compare →
24GB $0.12/hr Compare →
12GB $0.13/hr Compare →
32GB $0.14/hr Compare →
24GB $0.15/hr Compare →
12GB $0.16/hr Compare →
24GB $0.17/hr Compare →
20GB $0.17/hr Compare →
16GB $0.17/hr Compare →
20GB $0.19/hr Compare →
24GB $0.24/hr Compare →
32GB $0.29/hr Compare →
48GB $0.31/hr Compare →
16GB $0.32/hr Compare →
32GB $0.44/hr Compare →
24GB $0.46/hr Compare →
16GB $0.46/hr Compare →
48GB $0.56/hr Compare →
96GB $1.53/hr Compare →