MiMo-V2.5-Pro
Xiaomi開源權重MIT · 商用許可
描述
MiMo-V2.5-Pro is Xiaomi's 1.02T-parameter sparse Mixture-of-Experts language model with 42B active parameters and a 1M-token context window. It inherits the MiMo-V2-Flash hybrid-attention and Multi-Token Prediction design, extends context during pre-training up to 1M tokens, and uses supervised fine-tuning, domain-specialized reinforcement learning, and Multi-Teacher On-Policy Distillation to improve complex software engineering, long-horizon agentic tasks, and ultra-long-context coherence.
發布日期
2026-04-27
參數規模
1.0T
上下文長度
1.0M
支援模態
text
能力雷達圖
100
general
70
coding
80
reasoning
60
science估算
70
agents
0
multimodal
Science 在缺少專門科學評測時使用推理能力代理估算。
排行榜排名
基準測試分數 (LLM Stats)
Agents
GDPval-AA
1581.00 / 3000自報
FrontierSWE (Impl.)
340.0%自報
MiMo Coding Bench
73.7%自報
TAU3-Bench
72.9%自報
Terminal-Bench 2.0
68.4%自報
Claw-Eval
64.0%自報
SWE-Bench Pro
57.2%自報
WildClawBench
43.0%自報
Biology
GPQA
66.7%自報
Code
SWE-Bench Verified
78.9%自報
Finance
MMLU
89.4%自報
MMLU-Pro
68.5%自報
General
ARC-C
97.2%自報
MMLU-Redux
92.8%自報
C-Eval
91.5%自報
CMMLU
90.2%自報
Global-MMLU
83.6%自報
TriviaQA
81.3%自報
MBPP+
74.1%自報
LiveCodeBench v6
39.6%自報
SWE-bench Verified (Agentless)
35.7%自報
Language
BBH
88.4%自報
Winogrande
85.6%自報
Long Context
GraphWalks
62.0%自報
Math
GSM8k
99.6%自報
DROP
86.3%自報
MATH
86.2%自報
AIME
37.3%自報
Humanity's Last Exam
34.0%自報
Reasoning
HellaSwag
89.8%自報
HumanEval+
75.6%自報
AA 評測指數
暫無 AA 評測資料
LLM Stats 分類評分
Finance100
Legal100
Agents100
General100
Reasoning64
Language90
Frontend Development80
Healthcare80
Math80
Tool Calling70
Physics70
Biology70
Chemistry70
Code70
Long Context60
Coding60
Vision30
定價
輸入價格$0 / 1M tokens
輸出價格$0 / 1M tokens
混合價格(3:1)$0 / 1M tokens
快取讀取價格$0.2 / 1M tokens
速度
暫無速度資料
供應商價格排行
供應商價格排行
6 個供應商
最便宜: Xiaomi最貴: OpenCode Go
供應商輸入輸出
1Xiaomi主要
$0
$0
2DeepInfra
$0
$0
3Novita
$0
$0.00001
4CrofAI
$0.4
$0.8
5LLM Gateway
$1
$3
6OpenCode Go
$1.74
$3.48
比較該模型在不同 API 供應商之間的定價。