MiMo-V2.5-Pro
Xiaomi
Description
MiMo-V2.5-Pro is Xiaomi's 1.02T-parameter sparse Mixture-of-Experts language model with 42B active parameters and a 1M-token context window. It inherits the MiMo-V2-Flash hybrid-attention and Multi-Token Prediction design, extends context during pre-training up to 1M tokens, and uses supervised fine-tuning, domain-specialized reinforcement learning, and Multi-Teacher On-Policy Distillation to improve complex software engineering, long-horizon agentic tasks, and ultra-long-context coherence.
Release Date
2026-04-22
Parameters
—
Context Length
1.0M
Modalities
text
Capability Radar
40
general
59
coding
87
reasoning
63
scienceest.
70
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 25 | 81.0 | AA |
| General Ranking | 15 | 83.0 | AA |
| Science | 26 | 79.0 | AA |
Benchmark Scores (LLM Stats)
Agents
GDPval-AA
1286.00 / 3000SR
FrontierSWE (Impl.)
340.0%SR
MiMo Coding Bench
73.7%SR
TAU3-Bench
72.9%SR
Terminal-Bench 2.0
68.4%SR
Claw-Eval
64.0%SR
SWE-Bench Pro
57.2%SR
WildClawBench
43.0%SR
Finance Agent v2
41.5%SR
Biology
GPQA
66.7%SR
Code
SWE-Bench Verified
78.9%SR
Finance
MMLU
89.4%SR
MMLU-Pro
68.5%SR
General
ARC-C
97.2%SR
MMLU-Redux
92.8%SR
C-Eval
91.5%SR
CMMLU
90.2%SR
Global-MMLU
83.6%SR
TriviaQA
81.3%SR
MBPP+
74.1%SR
LiveCodeBench v6
39.6%SR
SWE-bench Verified (Agentless)
35.7%SR
Language
BBH
88.4%SR
Winogrande
85.6%SR
Long Context
GraphWalks
62.0%SR
Math
GSM8k
99.6%SR
DROP
86.3%SR
MATH
86.2%SR
AIME
37.3%SR
Humanity's Last Exam
34.0%SR
Reasoning
HellaSwag
89.8%SR
HumanEval+
75.6%SR
AA Evaluation Indices
Coding Index60.2
Intelligence Index42.2
Tau20.9
Gpqa0.9
Ifbench0.8
Lcr0.7
Terminalbench V2 10.7
Scicode0.5
Terminalbench Hard0.4
Hle0.3
Tau Banking0.1
LLM Stats Category Scores
Legal100
Finance100
Agents100
General100
Reasoning50
Language90
Math80
Frontend Development80
Healthcare80
Physics70
Biology70
Chemistry70
Code70
Tool Calling70
Long Context60
Coding60
Vision30
Pricing
Input Price$0.435 / 1M tokens
Output Price$0.87 / 1M tokens
Blended Price (3:1)$0.544 / 1M tokens
Cache Read Price$0.2 / 1M tokens
Speed
Tokens/sec50.5
Time to First Token1.86s
Time to Answer41.44s
Provider Price Ranking
Provider Price Ranking
3 providers
Cheapest: XiaomiMost Expensive: AIHubMix
ProviderInputOutput
1XiaomiPRIMARY
$0.435
$0.87
2routing.run
$0.45
$1.35
3AIHubMix
$1.1
$3.3
Compare pricing across different API providers for this model.