GPU group
Cheap inference.
Mid-range GPUs that run quantized 7B–13B models at the lowest hourly rates.
| GPU | Tier | Architecture | VRAM | From | |
|---|---|---|---|---|---|
| consumer | 12GB | $0.019/hr | Compare → | ||
| consumer | 12GB | $0.020/hr | Compare → | ||
| workstation | 12GB | $0.022/hr | Compare → | ||
| workstation | 20GB | $0.025/hr | Compare → | ||
| workstation | 16GB | $0.026/hr | Compare → | ||
| consumer | 12GB | $0.055/hr | Compare → | ||
| consumer | 16GB | $0.056/hr | Compare → | ||
| consumer | 16GB | $0.059/hr | Compare → | ||
| consumer | 24GB | $0.063/hr | Compare → | ||
| consumer | 12GB | $0.071/hr | Compare → | ||
| consumer | 16GB | $0.071/hr | Compare → | ||
| consumer | 12GB | $0.080/hr | Compare → | ||
| consumer | 16GB | $0.093/hr | Compare → | ||
| consumer | 16GB | $0.11/hr | Compare → | ||
| consumer | 24GB | $0.12/hr | Compare → | ||
| workstation | 32GB | $0.14/hr | Compare → | ||
| workstation | 24GB | $0.15/hr | Compare → | ||
| consumer | 12GB | $0.16/hr | Compare → | ||
| consumer | 24GB | $0.17/hr | Compare → | ||
| consumer | 12GB | $0.17/hr | Compare → | ||
| workstation | 20GB | $0.17/hr | Compare → | ||
| consumer | 16GB | $0.17/hr | Compare → | ||
| workstation | 20GB | $0.19/hr | Compare → | ||
| consumer | 24GB | $0.24/hr | Compare → | ||
| consumer | 32GB | $0.29/hr | Compare → | ||
| workstation | 48GB | $0.31/hr | Compare → | ||
| workstation | 16GB | $0.32/hr | Compare → | ||
| workstation | 32GB | $0.44/hr | Compare → | ||
| workstation | 16GB | $0.46/hr | Compare → | ||
| workstation | 24GB | $0.46/hr | Compare → | ||
| workstation | 48GB | $0.56/hr | Compare → | ||
| workstation | 96GB | $1.53/hr | Compare → |