Trouver un GPU →

Use case

Fine-tune LLMs (LoRA / QLoRA).

LoRA / QLoRA cuts VRAM requirements by an order of magnitude vs. full fine-tuning. A single 80GB datacenter GPU or 2× 24GB consumer cards can fine-tune a 70B model in a few hours.

≥ 24GB VRAM Datacenter tier

Best GPUs

Top GPUs for this workload.

Ranked by suitability — higher fitness scores mean the card handles this workload more comfortably.

GPU	Tier	VRAM	Fit
Nvidia Nvidia B200	datacenter	192GB	100	Compare →
Nvidia Nvidia H100	datacenter	80GB	95	Compare →
Nvidia Nvidia H200	datacenter	141GB	95	Compare →
Nvidia Nvidia A100	datacenter	40GB	85	Compare →
Nvidia Nvidia RTX 6000 Ada	workstation	48GB	75	Compare →
Nvidia Nvidia L40S	datacenter	48GB	70	Compare →
Nvidia Nvidia RTX A6000	workstation	48GB	70	Compare →

Best AI models

Top models for this workload.

Llama 3.1 70B

by Meta AI · Llama · 128,000 ctx

Llama 3.1 70B — production workhorse, superseded by 3.3 but still widely deployed.

Llama 3.1 8B

by Meta AI · Llama · 128,000 ctx

Meta's most popular open-weight small LLM — fits anywhere.

Llama 3.3 70B

by Meta AI · Llama · 128,000 ctx

Meta's best-in-class open-weight LLM — 70B class.

Mistral Nemo 12B

by Mistral AI · Mistral · 128,000 ctx

Mistral × Nvidia collab — 12B Apache-licensed, multilingual.

Qwen 2.5 32B

by Alibaba (Qwen Team) · Qwen · 128,000 ctx

32B Qwen 2.5 — laptop-class workhorse.

Qwen 2.5 14B

by Alibaba (Qwen Team) · Qwen · 128,000 ctx

14B Qwen 2.5 — sweet spot for single-GPU local hosting.

Qwen 2.5 7B

by Alibaba (Qwen Team) · Qwen · 128,000 ctx

7B Qwen 2.5 — most popular Qwen variant on Ollama.

Mistral 7B v0.3

by Mistral AI · Mistral 7B · 32,768 ctx

The current Mistral 7B — adds function calling + extended vocab.

Gemma 2 27B

by Google DeepMind · Gemma 2 · 8,192 ctx

Google's pre-Gemma-3 open-weight workhorse.

Qwen 2.5 72B

by Alibaba (Qwen Team) · Qwen · 128,000 ctx

Alibaba's flagship open-weight LLM — 72B dense.

Mixtral 8x22B

by Mistral AI · Mixtral · 65,536 ctx

Mistral's open MoE — 141B total, 39B active.

Gemma 3 27B

by Google DeepMind · Gemma · 128,000 ctx

Google's open-weight multimodal LLM — efficient and license-permissive.