Skip to main content

MiMo-V2.5-Pro (Non-reasoning)

Xiaomi

Description

MiMo-V2.5-Pro is Xiaomi's 1.02T-parameter sparse Mixture-of-Experts language model with 42B active parameters and a 1M-token context window. It inherits the MiMo-V2-Flash hybrid-attention and Multi-Token Prediction design, extends context during pre-training up to 1M tokens, and uses supervised fine-tuning, domain-specialized reinforcement learning, and Multi-Teacher On-Policy Distillation to improve complex software engineering, long-horizon agentic tasks, and ultra-long-context coherence.

Release Date
2026-04-22
Parameters
Context Length
1.0M
Modalities
text

Capability Radar

25
general
39
coding
76
reasoning
51
scienceest.
70
agents
0
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking175
52.0
AA
General Ranking182
52.0
AA
Science134
56.0
AA

Benchmark Scores (LLM Stats)

Agents

GDPval-AA1286.00 / 3000SR
FrontierSWE (Impl.)340.0%SR
MiMo Coding Bench73.7%SR
TAU3-Bench72.9%SR
Terminal-Bench 2.068.4%SR
Claw-Eval64.0%SR
SWE-Bench Pro57.2%SR
WildClawBench43.0%SR
Finance Agent v241.5%SR

Biology

GPQA66.7%SR

Code

SWE-Bench Verified78.9%SR

Finance

MMLU89.4%SR
MMLU-Pro68.5%SR

General

ARC-C97.2%SR
MMLU-Redux92.8%SR
C-Eval91.5%SR
CMMLU90.2%SR
Global-MMLU83.6%SR
TriviaQA81.3%SR
MBPP+74.1%SR
LiveCodeBench v639.6%SR
SWE-bench Verified (Agentless)35.7%SR

Language

BBH88.4%SR
Winogrande85.6%SR

Long Context

GraphWalks62.0%SR

Math

GSM8k99.6%SR
DROP86.3%SR
MATH86.2%SR
AIME37.3%SR
Humanity's Last Exam34.0%SR

Reasoning

HellaSwag89.8%SR
HumanEval+75.6%SR

AA Evaluation Indices

Intelligence Index
27.9
Gpqa
0.8
Tau2
0.7
Ifbench
0.4
Scicode
0.4
Terminalbench Hard
0.4
Lcr
0.3
Hle
0.1

LLM Stats Category Scores

Legal
100
Finance
100
Agents
100
General
100
Reasoning
50
Language
90
Math
80
Frontend Development
80
Healthcare
80
Physics
70
Biology
70
Chemistry
70
Code
70
Tool Calling
70
Long Context
60
Coding
60
Vision
30

Pricing

Input Price$0.9 / 1M tokens
Output Price$2.7 / 1M tokens
Blended Price (3:1)$1.35 / 1M tokens
Cache Read Price$0.2 / 1M tokens

Speed

Tokens/sec53.5
Time to First Token1.67s
Time to Answer1.67s

Provider Price Ranking

Provider Price Ranking

9 providers

Cheapest: NanoGPTMost Expensive: NovitaAI
ProviderInputOutput
1NanoGPTCheapest
$0.435
$0.87
2OpenRouter
$0.435
$0.87
3Vercel AI Gateway
$0.435
$0.87
4routing.run
$0.45
$1.35
5XiaomiPRIMARY
$0.9
$2.7
6ZenMux
$1
$3
7Deep Infra
$1
$3
8Kilo Gateway
$1
$3
9NovitaAI
$2
$6

Compare pricing across different API providers for this model.

External Sources