MiMo-V2.5-Pro (Non-reasoning)
Xiaomi
Description
MiMo-V2.5-Pro is Xiaomi's 1.02T-parameter sparse Mixture-of-Experts language model with 42B active parameters and a 1M-token context window. It inherits the MiMo-V2-Flash hybrid-attention and Multi-Token Prediction design, extends context during pre-training up to 1M tokens, and uses supervised fine-tuning, domain-specialized reinforcement learning, and Multi-Teacher On-Policy Distillation to improve complex software engineering, long-horizon agentic tasks, and ultra-long-context coherence.
Release Date
2026-04-22
Parameters
—
Context Length
1.0M
Modalities
text
Capability Radar
25
general
39
coding
76
reasoning
51
scienceest.
70
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 175 | 52.0 | AA |
| General Ranking | 182 | 52.0 | AA |
| Science | 134 | 56.0 | AA |
Benchmark Scores (LLM Stats)
Agents
GDPval-AA
1286.00 / 3000SR
FrontierSWE (Impl.)
340.0%SR
MiMo Coding Bench
73.7%SR
TAU3-Bench
72.9%SR
Terminal-Bench 2.0
68.4%SR
Claw-Eval
64.0%SR
SWE-Bench Pro
57.2%SR
WildClawBench
43.0%SR
Finance Agent v2
41.5%SR
Biology
GPQA
66.7%SR
Code
SWE-Bench Verified
78.9%SR
Finance
MMLU
89.4%SR
MMLU-Pro
68.5%SR
General
ARC-C
97.2%SR
MMLU-Redux
92.8%SR
C-Eval
91.5%SR
CMMLU
90.2%SR
Global-MMLU
83.6%SR
TriviaQA
81.3%SR
MBPP+
74.1%SR
LiveCodeBench v6
39.6%SR
SWE-bench Verified (Agentless)
35.7%SR
Language
BBH
88.4%SR
Winogrande
85.6%SR
Long Context
GraphWalks
62.0%SR
Math
GSM8k
99.6%SR
DROP
86.3%SR
MATH
86.2%SR
AIME
37.3%SR
Humanity's Last Exam
34.0%SR
Reasoning
HellaSwag
89.8%SR
HumanEval+
75.6%SR
AA Evaluation Indices
Intelligence Index27.9
Gpqa0.8
Tau20.7
Ifbench0.4
Scicode0.4
Terminalbench Hard0.4
Lcr0.3
Hle0.1
LLM Stats Category Scores
Legal100
Finance100
Agents100
General100
Reasoning50
Language90
Math80
Frontend Development80
Healthcare80
Physics70
Biology70
Chemistry70
Code70
Tool Calling70
Long Context60
Coding60
Vision30
Pricing
Input Price$0.9 / 1M tokens
Output Price$2.7 / 1M tokens
Blended Price (3:1)$1.35 / 1M tokens
Cache Read Price$0.2 / 1M tokens
Speed
Tokens/sec53.5
Time to First Token1.67s
Time to Answer1.67s
Provider Price Ranking
Provider Price Ranking
9 providers
Cheapest: NanoGPTMost Expensive: NovitaAI
ProviderInputOutput
1NanoGPTCheapest
$0.435
$0.87
2OpenRouter
$0.435
$0.87
3Vercel AI Gateway
$0.435
$0.87
4routing.run
$0.45
$1.35
5XiaomiPRIMARY
$0.9
$2.7
6ZenMux
$1
$3
7Deep Infra
$1
$3
8Kilo Gateway
$1
$3
9NovitaAI
$2
$6
Compare pricing across different API providers for this model.