跳转到主要内容

MiMo-V2.5-Pro

Xiaomi开源权重MIT · 商用许可

描述

MiMo-V2.5-Pro is Xiaomi's 1.02T-parameter sparse Mixture-of-Experts language model with 42B active parameters and a 1M-token context window. It inherits the MiMo-V2-Flash hybrid-attention and Multi-Token Prediction design, extends context during pre-training up to 1M tokens, and uses supervised fine-tuning, domain-specialized reinforcement learning, and Multi-Teacher On-Policy Distillation to improve complex software engineering, long-horizon agentic tasks, and ultra-long-context coherence.

发布日期
2026-04-27
参数规模
1.0T
上下文长度
1.0M
支持模态
text

能力雷达图

100
general
70
coding
80
reasoning
60
science估算
70
agents
0
multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域#排名分数来源
智能体能力模型榜29
65.0
LS
推理能力23
83.0
LS

基准测试分数 (LLM Stats)

Agents

GDPval-AA1581.00 / 3000自报
FrontierSWE (Impl.)340.0%自报
MiMo Coding Bench73.7%自报
TAU3-Bench72.9%自报
Terminal-Bench 2.068.4%自报
Claw-Eval64.0%自报
SWE-Bench Pro57.2%自报
WildClawBench43.0%自报

Biology

GPQA66.7%自报

Code

SWE-Bench Verified78.9%自报

Finance

MMLU89.4%自报
MMLU-Pro68.5%自报

General

ARC-C97.2%自报
MMLU-Redux92.8%自报
C-Eval91.5%自报
CMMLU90.2%自报
Global-MMLU83.6%自报
TriviaQA81.3%自报
MBPP+74.1%自报
LiveCodeBench v639.6%自报
SWE-bench Verified (Agentless)35.7%自报

Language

BBH88.4%自报
Winogrande85.6%自报

Long Context

GraphWalks62.0%自报

Math

GSM8k99.6%自报
DROP86.3%自报
MATH86.2%自报
AIME37.3%自报
Humanity's Last Exam34.0%自报

Reasoning

HellaSwag89.8%自报
HumanEval+75.6%自报

AA 评测指数

暂无 AA 评测数据

LLM Stats 分类评分

Finance
100
Legal
100
Agents
100
General
100
Reasoning
64
Language
90
Frontend Development
80
Healthcare
80
Math
80
Tool Calling
70
Physics
70
Biology
70
Chemistry
70
Code
70
Long Context
60
Coding
60
Vision
30

定价

输入价格$0 / 1M tokens
输出价格$0 / 1M tokens
混合价格(3:1)$0 / 1M tokens
缓存读取价格$0.2 / 1M tokens

速度

暂无速度数据

供应商价格排行

供应商价格排行

6 个供应商

最便宜: Xiaomi最贵: OpenCode Go
供应商输入输出
1Xiaomi主要
$0
$0
2DeepInfra
$0
$0
3Novita
$0
$0.00001
4CrofAI
$0.4
$0.8
5LLM Gateway
$1
$3
6OpenCode Go
$1.74
$3.48

比较该模型在不同 API 供应商之间的定价。

外部链接