MiMo-V2.5
XiaomiOpen WeightMIT · Commercial OK
Description
MiMo-V2.5 is Xiaomi's native omnimodal sparse Mixture-of-Experts model with 310B total parameters, 15B activated parameters, and a 1M-token context window. Built on the MiMo-V2-Flash backbone, it adds dedicated vision and audio encoders for text, image, video, and audio understanding, and is post-trained with SFT, agentic reinforcement learning, and Multi-Teacher On-Policy Distillation for multimodal perception, long-context reasoning, and agentic workflows.
Release Date
2026-04-22
Parameters
310.8B
Context Length
1.0M
Modalities
audio, image, text, video
Capability Radar
80
general
60
coding
80
reasoning
68
scienceest.
70
agents
80
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Agentic Capability | 75 | 54.0 | LS |
| Multimodal Ranking | 32 | 81.0 | LS |
Benchmark Scores (LLM Stats)
Agents
MiMo Coding Bench
71.8%SR
Terminal-Bench 2.0
65.8%SR
Claw-Eval
63.2%SR
SWE-Bench Pro
56.1%SR
ResearchClawBench
16.9%SR
Document Understanding
OmniDocBench
87.2%SR
General
MMMU-Pro
77.9%SR
Long Context
GraphWalks
87.0%SR
Multimodal
HR-Bench (4k)
88.5%SR
Video-MME
87.7%SR
DailyOmni
83.5%SR
CharXiv-R
81.0%SR
VideoHolmes
64.0%SR
AA Evaluation Indices
No AA evaluation data available
LLM Stats Category Scores
Long Context90
Reasoning80
General80
Multimodal80
Vision80
Tool Calling70
Agents60
Code60
Coding60
Pricing
Input Price$0 / 1M tokens
Output Price$0 / 1M tokens
Blended Price (3:1)$0 / 1M tokens
Cache Read Price$0.08 / 1M tokens
Speed
No speed data available
Provider Price Ranking
Provider Price Ranking
6 providers
Cheapest: XiaomiMost Expensive: LLM Gateway
ProviderInputOutput
1XiaomiPRIMARY
$0
$0
2Novita
$0
$0
3DeepInfra
$0
$0
4OpenCode Go
$0.14
$0.28
5Venice AI
$0.175
$0.35
6LLM Gateway
$0.4
$2
Compare pricing across different API providers for this model.