MiMo-V2.5

XiaomiOpen WeightMIT · Commercial OK

Description

MiMo-V2.5 is Xiaomi's native omnimodal sparse Mixture-of-Experts model with 310B total parameters, 15B activated parameters, and a 1M-token context window. Built on the MiMo-V2-Flash backbone, it adds dedicated vision and audio encoders for text, image, video, and audio understanding, and is post-trained with SFT, agentic reinforcement learning, and Multi-Teacher On-Policy Distillation for multimodal perception, long-context reasoning, and agentic workflows.

Release Date

2026-04-22

Parameters

310.8B

Context Length

1.0M

Modalities

audio, image, text, video

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Agentic Capability	75	54.0	LS
Multimodal Ranking	32	81.0	LS

Benchmark Scores (LLM Stats)

Agents

MiMo Coding Bench

71.8%SR

Terminal-Bench 2.0

65.8%SR

Claw-Eval

63.2%SR

SWE-Bench Pro

56.1%SR

ResearchClawBench

16.9%SR

Document Understanding

OmniDocBench

87.2%SR

General

MMMU-Pro

77.9%SR

Long Context

GraphWalks

87.0%SR

Multimodal

HR-Bench (4k)

88.5%SR

Video-MME

87.7%SR

DailyOmni

83.5%SR

CharXiv-R

81.0%SR

VideoHolmes

64.0%SR

AA Evaluation Indices

No AA evaluation data available

LLM Stats Category Scores

Long Context

Reasoning

General

Multimodal

Vision

Tool Calling

Agents

Code

Coding

Pricing

Input Price$0 / 1M tokens

Output Price$0 / 1M tokens

Blended Price (3:1)$0 / 1M tokens

Cache Read Price$0.08 / 1M tokens

Speed

No speed data available

Provider Price Ranking

6 providers

Cheapest: XiaomiMost Expensive: LLM Gateway

ProviderInputOutput

1XiaomiPRIMARY

2Novita

3DeepInfra

4OpenCode Go

$0.14

$0.28

5Venice AI

$0.175

$0.35

6LLM Gateway

$0.4

Compare pricing across different API providers for this model.

External Sources

LLM Stats Artificial Analysis