MiMo-V2.5-Pro

Xiaomi

描述

MiMo-V2.5-Pro is Xiaomi's 1.02T-parameter sparse Mixture-of-Experts language model with 42B active parameters and a 1M-token context window. It inherits the MiMo-V2-Flash hybrid-attention and Multi-Token Prediction design, extends context during pre-training up to 1M tokens, and uses supervised fine-tuning, domain-specialized reinforcement learning, and Multi-Teacher On-Policy Distillation to improve complex software engineering, long-horizon agentic tasks, and ultra-long-context coherence.

发布日期

2026-04-22

参数规模

—

上下文长度

1.0M

支持模态

text

能力雷达图

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域	#排名	分数	来源
代码能力榜	25	81.0	AA
通用能力榜	15	83.0	AA
科学能力	26	79.0	AA

基准测试分数 (LLM Stats)

Agents

GDPval-AA

1286.00 / 3000自报

FrontierSWE (Impl.)

340.0%自报

MiMo Coding Bench

73.7%自报

TAU3-Bench

72.9%自报

Terminal-Bench 2.0

68.4%自报

Claw-Eval

64.0%自报

SWE-Bench Pro

57.2%自报

WildClawBench

43.0%自报

Finance Agent v2

41.5%自报

Biology

GPQA

66.7%自报

Code

SWE-Bench Verified

78.9%自报

Finance

MMLU

89.4%自报

MMLU-Pro

68.5%自报

General

ARC-C

97.2%自报

MMLU-Redux

92.8%自报

C-Eval

91.5%自报

CMMLU

90.2%自报

Global-MMLU

83.6%自报

TriviaQA

81.3%自报

MBPP+

74.1%自报

LiveCodeBench v6

39.6%自报

SWE-bench Verified (Agentless)

35.7%自报

Language

BBH

88.4%自报

Winogrande

85.6%自报

Long Context

GraphWalks

62.0%自报

Math

GSM8k

99.6%自报

DROP

86.3%自报

MATH

86.2%自报

AIME

37.3%自报

Humanity's Last Exam

34.0%自报

Reasoning

HellaSwag

89.8%自报

HumanEval+

75.6%自报

AA 评测指数

Coding Index

60.2

Intelligence Index

42.2

Tau2

0.9

Gpqa

0.9

Ifbench

0.8

Lcr

0.7

Terminalbench V2 1

0.7

Scicode

0.5

Terminalbench Hard

0.4

Hle

0.3

Tau Banking

0.1

LLM Stats 分类评分

Legal

100

Finance

100

Agents

100

General

100

Reasoning

Language

Math

Frontend Development

Healthcare

Physics

Biology

Chemistry

Code

Tool Calling

Long Context

Coding

Vision

定价

输入价格$0.435 / 1M tokens

输出价格$0.87 / 1M tokens

混合价格(3:1)$0.544 / 1M tokens

缓存读取价格$0.2 / 1M tokens

速度

Tokens/秒50.5

首Token延迟1.86s

首回答延迟41.44s

供应商价格排行

3 个供应商

最便宜: Xiaomi最贵: AIHubMix

供应商输入输出

1Xiaomi主要

$0.435

$0.87

2routing.run

$0.45

$1.35

3AIHubMix

$1.1

$3.3

比较该模型在不同 API 供应商之间的定价。

外部链接

Artificial Analysis