First-party APIs OpenAI-compatible CN

Moonshot AI Platform.

Moonshot AI's developer API for the Kimi family. Both open-weight (K2 series) and closed flagship (Moonshot v1) accessible from one endpoint.

Cheapest 12 models

Where the floor is.

Sorted cheapest-first by $/M input. Useful when you're looking for the floor before picking a model.

Loading...

At a glance

Service type
First-party APIs
Trust tier
Tier 1
Headquarters
CN
OpenAI-compat
Yes
Open weights
Yes
Proprietary
Yes

When to pick Moonshot AI Platform

Best for

  • Full feature coverage — prompt caching, batch tier, function calling, fine-tuning.
  • The lowest per-token rate for the maker's own models.
  • Production workloads where a direct billing relationship matters.

Avoid for

  • Multi-model workflows that need a unified billing surface.
  • Anywhere the maker's own SLA isn't sufficient.

Models on Moonshot AI Platform

Pricing + measured speed + self-host alternative, one row per model. Click a column header to sort.

7 models · 0 benchmarked
Model ↕ Maker ↕ Access ↕ $/M in ↕ $/M out ↕ Tokens/sec ↕ TTFT ↕ Self-host on ↕
Kimi K2.5 Moonshot AI api direct API only Open →
Kimi K2.6 Moonshot AI api direct API only Open →
MoonshotAI: Kimi K2 0905 Moonshot AI api direct $0.6 $2.5 4× AMD MI300 · INT4 Open →
Kimi K2 Thinking Moonshot AI api direct $0.6 $2.5 4× AMD MI300 · INT4 Open →
Kimi K2 Moonshot AI api direct $0.57 $2.3 4× Nvidia H200 · FP8 Open →
Kimi K2.5 Moonshot AI api direct $0.6 $2.5 4× AMD MI300 · INT4 Open →
Kimi K2.6 Moonshot AI api direct $0.6 $2.5 4× AMD MI300 · INT4 Open →