MiMo-V2.5-Pro
Xiaomi
描述
MiMo-V2.5-Pro is Xiaomi's 1.02T-parameter sparse Mixture-of-Experts language model with 42B active parameters and a 1M-token context window. It inherits the MiMo-V2-Flash hybrid-attention and Multi-Token Prediction design, extends context during pre-training up to 1M tokens, and uses supervised fine-tuning, domain-specialized reinforcement learning, and Multi-Teacher On-Policy Distillation to improve complex software engineering, long-horizon agentic tasks, and ultra-long-context coherence.
发布日期
2026-04-22
参数规模
—
上下文长度
1.0M
支持模态
text
能力雷达图
40
general
59
coding
87
reasoning
63
science估算
70
agents
0
multimodal
Science 在缺少专门科学评测时使用推理能力代理估算。
排行榜排名
基准测试分数 (LLM Stats)
Agents
GDPval-AA
1286.00 / 3000自报
FrontierSWE (Impl.)
340.0%自报
MiMo Coding Bench
73.7%自报
TAU3-Bench
72.9%自报
Terminal-Bench 2.0
68.4%自报
Claw-Eval
64.0%自报
SWE-Bench Pro
57.2%自报
WildClawBench
43.0%自报
Finance Agent v2
41.5%自报
Biology
GPQA
66.7%自报
Code
SWE-Bench Verified
78.9%自报
Finance
MMLU
89.4%自报
MMLU-Pro
68.5%自报
General
ARC-C
97.2%自报
MMLU-Redux
92.8%自报
C-Eval
91.5%自报
CMMLU
90.2%自报
Global-MMLU
83.6%自报
TriviaQA
81.3%自报
MBPP+
74.1%自报
LiveCodeBench v6
39.6%自报
SWE-bench Verified (Agentless)
35.7%自报
Language
BBH
88.4%自报
Winogrande
85.6%自报
Long Context
GraphWalks
62.0%自报
Math
GSM8k
99.6%自报
DROP
86.3%自报
MATH
86.2%自报
AIME
37.3%自报
Humanity's Last Exam
34.0%自报
Reasoning
HellaSwag
89.8%自报
HumanEval+
75.6%自报
AA 评测指数
Coding Index60.2
Intelligence Index42.2
Tau20.9
Gpqa0.9
Ifbench0.8
Lcr0.7
Terminalbench V2 10.7
Scicode0.5
Terminalbench Hard0.4
Hle0.3
Tau Banking0.1
LLM Stats 分类评分
Legal100
Finance100
Agents100
General100
Reasoning50
Language90
Math80
Frontend Development80
Healthcare80
Physics70
Biology70
Chemistry70
Code70
Tool Calling70
Long Context60
Coding60
Vision30
定价
输入价格$0.435 / 1M tokens
输出价格$0.87 / 1M tokens
混合价格(3:1)$0.544 / 1M tokens
缓存读取价格$0.2 / 1M tokens
速度
Tokens/秒50.5
首Token延迟1.86s
首回答延迟41.44s
供应商价格排行
供应商价格排行
3 个供应商
最便宜: Xiaomi最贵: AIHubMix
供应商输入输出
1Xiaomi主要
$0.435
$0.87
2routing.run
$0.45
$1.35
3AIHubMix
$1.1
$3.3
比较该模型在不同 API 供应商之间的定价。