Passer au contenu principal

MiniMax-M2.5

MiniMaxMiniMaxOpen WeightMIT · Commercial OK

Description

MiniMax M2.5 is the world's first production-level model designed natively for Agent scenarios. Building on the M2.1 foundation, M2.5 delivers significant improvements in programming, tool calling, search, and office productivity. With only 10B activation parameters from its 230B MoE architecture, it achieves competitive performance against top international models like Claude Opus 4.6 while maintaining high throughput and efficient inference. M2.5 supports full-stack development for PC, App, and cross-platform applications, and excels in agentic workflows including automated customer support, data-analysis pipelines, and complex task execution.

Date de sortie
2026-02-12
Paramètres
230.0B
Longueur du contexte
197K
Modalités
image, text

Radar de capacités

37
general
38
coding
85
reasoning
57
scienceest.
70
agents
60
multimodal

Science utilise un proxy de raisonnement lorsque les benchmarks scientifiques dédiés ne sont pas disponibles.

Classements

Domaine#RangScoreSource
Agents & Tools27
65.0
LS
Code Ranking68
68.0
AA
General Ranking46
79.0
AA
Science71
67.0
AA

Scores de benchmarks (LLM Stats)

Agents

BrowseComp76.3%Aut.
MEWC74.4%Aut.
SWE-Bench Pro55.4%Aut.
VIBE-Pro54.2%Aut.

Code

SWE-Bench Verified80.2%Aut.
Multi-SWE-Bench51.3%Aut.

Finance

GDPval-MM59.0%Aut.

General

BFCL_v3_MultiTurn76.8%Aut.

Indices d'évaluation AA

Intelligence Index
41.9
Coding Index
37.4
Tau2
1.0
Gpqa
0.8
Ifbench
0.7
Lcr
0.7
Scicode
0.4
Terminalbench Hard
0.3
Hle
0.2

Scores par catégorie LLM Stats

Frontend Development
80
Search
80
Agents
70
Code
60
Finance
60
General
60
Multimodal
60
Reasoning
60

Tarification

Prix d'entrée$0.3 / 1M tokens
Prix de sortie$1.2 / 1M tokens
Prix mixte (3:1)$0.525 / 1M tokens

Vitesse

Tokens/sec84.0 tokens/s
Délai du premier token1.44s
Temps de réponse25.25s

Fournisseurs disponibles

(Unités internes LS)
FournisseurPrix d'entréePrix de sortie
MiniMax300K1.2M

Sources externes