MiniMax M2.5 is built for state-of-the-art coding, agentic tool use, search, and office work, extensively trained with reinforcement learning across hundreds of thousands of real-world environments to plan like an archit
Where to use it
Cheapest hosted endpoints.
FAQ
Frequently asked.
How do I run MiniMax-M2.5?
MiniMax-M2.5 is open-weight, so you can self-host on rented GPUs. See the Run It Yourself tab for GPU configurations + cost estimates, or use one of the hosted inference providers listed on this page.
Where can I access MiniMax-M2.5?
MiniMax-M2.5 is available via MiniMax Platform, Fireworks AI. Each access option lists its own pricing (per million tokens or hourly hosting).
How much does it cost to run MiniMax-M2.5?
Self-hosting cost depends on the GPU you rent and the throughput you need. See the Run It Yourself tab for GPU configurations and hourly cost estimates.
Is MiniMax-M2.5 open-source or proprietary?
MiniMax-M2.5 is open-weight under the license. You can download and self-host it.
Context window
How much it can remember.
197K tokens
≈ 147,456 English words
4K
32K
128K
1M
Capabilities
What it can do.
·
Vision input
·
Audio input
·
Video input
·
Function calling
·
Tool use
·
JSON mode
✓
Streaming
·
Fine-tuning