Passer au contenu principal

MiniMax-M2

MiniMaxMiniMaxOpen WeightMIT · Commercial OK

Description

MiniMax M2 is an open-source large language model by MiniMax, built for agents and coding tasks. It delivers state-of-the-art tool use, reasoning, and search performance while maintaining exceptional cost-efficiency and speed, priced at just 8% of Claude 3.5 Sonnet’s cost and running at nearly double its inference speed (≈100 TPS). Designed for end-to-end agentic workflows, it excels at long-chain tool calling across Shell, Browser, Python, and other MCP tools. While slightly behind top overseas models in programming, it ranks among the best domestic models and top five globally on the Artificial Analysis benchmark. M2 powers the MiniMax Agent platform, available in Lightning Mode for fast tasks and Pro Mode for complex multi-step reasoning, and its weights, API, and deployment guides are freely available on Hugging Face, vLLM, and SGLang.

Date de sortie
2025-10-26
Paramètres
230.0B
Longueur du contexte
197K
Modalités
text

Radar de capacités

46
general
49
coding
78
reasoning
50
scienceest.
80
agents
0
multimodal

Science utilise un proxy de raisonnement lorsque les benchmarks scientifiques dédiés ne sont pas disponibles.

Classements

Domaine#RangScoreSource
Agents & Tools47
58.0
LS
Code Ranking98
62.0
AA
General Ranking64
76.0
AA
Math Reasoning92
79.0
AA
Reasoning89
49.0
LS
Science136
56.0
AA

Scores de benchmarks (LLM Stats)

Agents

Tau-bench77.2%Aut.
Terminal-Bench46.3%Aut.
BrowseComp44.0%Aut.

Biology

GPQA78.0%Aut.
SciCode36.0%Aut.

Code

LiveCodeBench83.0%Aut.
SWE-Bench Verified69.4%Aut.
SWE-bench Multilingual56.5%Aut.
Multi-SWE-Bench36.2%Aut.

Communication

Tau2 Telecom87.0%Aut.

Finance

MMLU-Pro82.0%Aut.

General

IF72.0%Aut.
AA-Index61.0%Aut.

Math

AIME 202578.0%Aut.
Humanity's Last Exam12.5%Aut.

Reasoning

BrowseComp-zh48.5%Aut.

Indices d'évaluation AA

Math Index
78.3
Intelligence Index
36.1
Coding Index
29.2
Tau2
0.9
Livecodebench
0.8
Mmlu Pro
0.8
Aime 25
0.8
Gpqa
0.8
Ifbench
0.7
Lcr
0.6
Scicode
0.4
Terminalbench Hard
0.3
Hle
0.1

Scores par catégorie LLM Stats

Communication
90
Tool Calling
80
Finance
80
General
80
Healthcare
80
Language
80
Legal
80
Frontend Development
70
Agents
60
Biology
60
Chemistry
60
Physics
60
Reasoning
60
Code
50
Math
50
Search
50
Vision
10

Tarification

Prix d'entrée$0.3 / 1M tokens
Prix de sortie$1.2 / 1M tokens
Prix mixte (3:1)$0.525 / 1M tokens

Vitesse

Tokens/sec85.3 tokens/s
Délai du premier token1.24s
Temps de réponse24.68s

Fournisseurs disponibles

(Unités internes LS)
FournisseurPrix d'entréePrix de sortie
Novita300K1.2M
MiniMax300K1.2M

Sources externes