Skip to main content

MiMo-V2.5-Pro

Xiaomi

Description

MiMo-V2.5-Pro is Xiaomi's 1.02T-parameter sparse Mixture-of-Experts language model with 42B active parameters and a 1M-token context window. It inherits the MiMo-V2-Flash hybrid-attention and Multi-Token Prediction design, extends context during pre-training up to 1M tokens, and uses supervised fine-tuning, domain-specialized reinforcement learning, and Multi-Teacher On-Policy Distillation to improve complex software engineering, long-horizon agentic tasks, and ultra-long-context coherence.

Release Date
2026-04-22
Parameters
Context Length
1.0M
Modalities
text

Capability Radar

40
general
59
coding
87
reasoning
63
scienceest.
70
agents
0
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking25
81.0
AA
General Ranking15
83.0
AA
Science26
79.0
AA

Benchmark Scores (LLM Stats)

Agents

GDPval-AA1286.00 / 3000SR
FrontierSWE (Impl.)340.0%SR
MiMo Coding Bench73.7%SR
TAU3-Bench72.9%SR
Terminal-Bench 2.068.4%SR
Claw-Eval64.0%SR
SWE-Bench Pro57.2%SR
WildClawBench43.0%SR
Finance Agent v241.5%SR

Biology

GPQA66.7%SR

Code

SWE-Bench Verified78.9%SR

Finance

MMLU89.4%SR
MMLU-Pro68.5%SR

General

ARC-C97.2%SR
MMLU-Redux92.8%SR
C-Eval91.5%SR
CMMLU90.2%SR
Global-MMLU83.6%SR
TriviaQA81.3%SR
MBPP+74.1%SR
LiveCodeBench v639.6%SR
SWE-bench Verified (Agentless)35.7%SR

Language

BBH88.4%SR
Winogrande85.6%SR

Long Context

GraphWalks62.0%SR

Math

GSM8k99.6%SR
DROP86.3%SR
MATH86.2%SR
AIME37.3%SR
Humanity's Last Exam34.0%SR

Reasoning

HellaSwag89.8%SR
HumanEval+75.6%SR

AA Evaluation Indices

Coding Index
60.2
Intelligence Index
42.2
Tau2
0.9
Gpqa
0.9
Ifbench
0.8
Lcr
0.7
Terminalbench V2 1
0.7
Scicode
0.5
Terminalbench Hard
0.4
Hle
0.3
Tau Banking
0.1

LLM Stats Category Scores

Legal
100
Finance
100
Agents
100
General
100
Reasoning
50
Language
90
Math
80
Frontend Development
80
Healthcare
80
Physics
70
Biology
70
Chemistry
70
Code
70
Tool Calling
70
Long Context
60
Coding
60
Vision
30

Pricing

Input Price$0.435 / 1M tokens
Output Price$0.87 / 1M tokens
Blended Price (3:1)$0.544 / 1M tokens
Cache Read Price$0.2 / 1M tokens

Speed

Tokens/sec50.5
Time to First Token1.86s
Time to Answer41.44s

Provider Price Ranking

Provider Price Ranking

3 providers

Cheapest: XiaomiMost Expensive: AIHubMix
ProviderInputOutput
1XiaomiPRIMARY
$0.435
$0.87
2routing.run
$0.45
$1.35
3AIHubMix
$1.1
$3.3

Compare pricing across different API providers for this model.

External Sources