by Mistral AI

Mistral Large 2.

text open weights datacenter 123B params 128K ctx Transformer Quality 89.0
Cheapest input
$2.0/M
on Mistral La Plateforme
Cheapest output
$6.0/M
on Mistral La Plateforme
Fastest
26 tok/s
on OpenRouter
Smallest GPU
1× Nvidia RTX PRO 6000 S
Capability snapshot

What it's best at.

Coding 92.0
General knowledge 84.0
Math 73.0

Scores normalised against benchmark ceilings (100 = perfect). Coloured by tier — coral 80+ frontier, lavender 65+ strong, sage 50+ solid, slate below.

Benchmarks

Published scores.

Benchmark Score Source
MATH 73.0 official ↗
MMLU 84.0 official ↗
HumanEval 92.0 official ↗
Leaderboard standing

Independent rankings.

Artificial Analysis Quality Index
53.0
Composite of reasoning + coding + tool-use benchmarks
View on Artificial Analysis ↗
Description

About Mistral Large 2.

Mistral Large 2 is Mistral AI's flagship open-weight model (123B dense parameters), released July 2024 under the Mistral Research License — free for research and non-commercial use, paid commercial license required for production. Strong on coding and multilingual tasks; competitive with Llama 3.1 405B at 1/3 the inference cost. 128K context. Available via Mistral's own API (la Plateforme), AWS Bedrock, Azure AI, and self-hosted via vLLM.

Architecture

How it's built.

Architecture
Transformer
Knowledge cutoff
Apr 2024
114 days from cutoff to release.
Context window

How much it can remember.

128K tokens ≈ 96,000 English words
4K 32K 128K 1M
Max output per call: 4K tokens
Capabilities

What it can do.

· Vision input
· Audio input
· Video input
Function calling
Tool use
JSON mode
Streaming
Fine-tuning