by Meta AI

Llama 3.1 405B.

text open weights datacenter 405B params 128K ctx Transformer Quality 76.6
Smallest GPU
1× AMD MI325
Capability snapshot

What it's best at.

Coding 89.0
General knowledge 88.6
Instruction-following 88.6
Math 73.8

Scores normalised against benchmark ceilings (100 = perfect). Coloured by tier — coral 80+ frontier, lavender 65+ strong, sage 50+ solid, slate below.

Benchmarks

Published scores.

Benchmark Score Source
GPQA 51.1 official ↗
MATH 73.8 official ↗
MMLU 88.6 official ↗
IFEval 88.6 official ↗
MMLU-Pro 73.3 official ↗
HumanEval 89.0 official ↗
Leaderboard standing

Independent rankings.

Artificial Analysis Quality Index
56.0
Composite of reasoning + coding + tool-use benchmarks
View on Artificial Analysis ↗
Description

About Llama 3.1 405B.

Llama 3.1 405B is Meta's largest open-weight model — dense (not MoE), 405B parameters trained on 15T tokens with 30M H100-hours of compute. At launch (July 2024) it was the first open model competitive with GPT-4 and Claude 3.5 Sonnet on most reasoning benchmarks. Mostly superseded by Llama 3.3 70B for production (same quality at 1/6th the inference cost), but still relevant for research and as a teacher model for distillation. Runs on 8× H100 (FP16) or 4× H100 (FP8).

Architecture

How it's built.

Architecture
Transformer
Trained on
15.6T tokens
39 tokens per parameter — below the Chinchilla optimum.
Knowledge cutoff
Dec 2023
235 days from cutoff to release.
Context window

How much it can remember.

128K tokens ≈ 96,000 English words
4K 32K 128K 1M
Max output per call: 4K tokens
Capabilities

What it can do.

· Vision input
· Audio input
· Video input
Function calling
Tool use
JSON mode
Streaming
Fine-tuning