MiMo-V2.5-Pro (Non-reasoning)

Xiaomi

Description

MiMo-V2.5-Pro is Xiaomi's 1.02T-parameter sparse Mixture-of-Experts language model with 42B active parameters and a 1M-token context window. It inherits the MiMo-V2-Flash hybrid-attention and Multi-Token Prediction design, extends context during pre-training up to 1M tokens, and uses supervised fine-tuning, domain-specialized reinforcement learning, and Multi-Teacher On-Policy Distillation to improve complex software engineering, long-horizon agentic tasks, and ultra-long-context coherence.

Release Date

2026-04-22

Parameters

—

Context Length

1.0M

Modalities

text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Code Ranking	175	52.0	AA
General Ranking	182	52.0	AA
Science	134	56.0	AA

Benchmark Scores (LLM Stats)

Agents

GDPval-AA

1286.00 / 3000SR

FrontierSWE (Impl.)

340.0%SR

MiMo Coding Bench

73.7%SR

TAU3-Bench

72.9%SR

Terminal-Bench 2.0

68.4%SR

Claw-Eval

64.0%SR

SWE-Bench Pro

57.2%SR

WildClawBench

43.0%SR

Finance Agent v2

41.5%SR

Biology

GPQA

66.7%SR

Code

SWE-Bench Verified

78.9%SR

Finance

MMLU

89.4%SR

MMLU-Pro

68.5%SR

General

ARC-C

97.2%SR

MMLU-Redux

92.8%SR

C-Eval

91.5%SR

CMMLU

90.2%SR

Global-MMLU

83.6%SR

TriviaQA

81.3%SR

MBPP+

74.1%SR

LiveCodeBench v6

39.6%SR

SWE-bench Verified (Agentless)

35.7%SR

Language

BBH

88.4%SR

Winogrande

85.6%SR

Long Context

GraphWalks

62.0%SR

Math

GSM8k

99.6%SR

DROP

86.3%SR

MATH

86.2%SR

AIME

37.3%SR

Humanity's Last Exam

34.0%SR

Reasoning

HellaSwag

89.8%SR

HumanEval+

75.6%SR

AA Evaluation Indices

Intelligence Index

27.9

Gpqa

0.8

Tau2

0.7

Ifbench

0.4

Scicode

0.4

Terminalbench Hard

0.4

Lcr

0.3

Hle

0.1

LLM Stats Category Scores

Legal

100

Finance

100

Agents

100

General

100

Reasoning

Language

Math

Frontend Development

Healthcare

Physics

Biology

Chemistry

Code

Tool Calling

Long Context

Coding

Vision

Pricing

Input Price$0.9 / 1M tokens

Output Price$2.7 / 1M tokens

Blended Price (3:1)$1.35 / 1M tokens

Cache Read Price$0.2 / 1M tokens

Speed

Tokens/sec53.5

Time to First Token1.67s

Time to Answer1.67s

Provider Price Ranking

9 providers

Cheapest: NanoGPTMost Expensive: NovitaAI

ProviderInputOutput

1NanoGPTCheapest

$0.435

$0.87

2OpenRouter

$0.435

$0.87

3Vercel AI Gateway

$0.435

$0.87

4routing.run

$0.45

$1.35

5XiaomiPRIMARY

$0.9

$2.7

6ZenMux

7Deep Infra

8Kilo Gateway

9NovitaAI

Compare pricing across different API providers for this model.

External Sources

Artificial Analysis