by Moonshot AI

Kimi K2.

text open weights datacenter 1000B params 32B active 256K ctx MoE Quality 78.5
Cheapest input
$0.57/M
on Moonshot AI Platform
Cheapest output
$2.3/M
on Moonshot AI Platform
Fastest
14 tok/s
on OpenRouter
Smallest GPU
4× AMD MI300
Capability snapshot

What it's best at.

Reasoning 78.5
Code repair 65.8

Scores normalised against benchmark ceilings (100 = perfect). Coloured by tier — coral 80+ frontier, lavender 65+ strong, sage 50+ solid, slate below.

Benchmarks

Published scores.

Benchmark Score Source
MMLU-Pro 78.5 official ↗
SWE-bench 65.8 official ↗
Description

About Kimi K2.

Kimi K2 is Moonshot AI's frontier open-weight model — 1T parameters total (Mixture-of-Experts) with 32B activated per token. Trained primarily for agentic tasks; tops several SWE-bench-style coding benchmarks at the open-weight tier. The K2 release made waves in mid-2025 by matching closed-frontier models on coding and tool use while remaining fully open. Available on Hugging Face under a modified MIT license. Cost-competitive with Claude Sonnet on Moonshot's own API. 256K context.

Architecture

How it's built.

Architecture
MoE
Mixture of Experts — 32B params active per token out of 1000B total.
Knowledge cutoff
Dec 2024
222 days from cutoff to release.
Context window

How much it can remember.

256K tokens ≈ 192,000 English words
4K 32K 128K 1M
Max output per call: 33K tokens
Capabilities

What it can do.

· Vision input
· Audio input
· Video input
Function calling
Tool use
JSON mode
Streaming
Fine-tuning
All access providers

Every place this model is hosted.