Passer au contenu principal

MiniMax-M2.1

MiniMaxMiniMaxOpen WeightMIT · Commercial OK

Description

MiniMax M2.1 is an enhanced large language model focused on multi-language programming and real-world complex tasks. It features exceptional capabilities across Rust, Java, Golang, C++, Kotlin, Objective-C, TypeScript, JavaScript and more, with industry-leading multilingual performance that outperforms Claude Sonnet 4.5 and approaches Claude Opus 4.5. M2.1 significantly strengthens native Android and iOS development, delivers enhanced design comprehension and aesthetic expression for web/app scenarios, and provides more concise responses with improved speed and reduced token consumption. It excels across various coding agent frameworks including Claude Code, Droid (Factory AI), Cline, Kilo Code, Roo Code, and BlackBox.

Date de sortie
2025-12-23
Paramètres
230.0B
Longueur du contexte
197K
Modalités
text

Radar de capacités

51
general
50
coding
83
reasoning
56
scienceest.
70
agents
0
multimodal

Science utilise un proxy de raisonnement lorsque les benchmarks scientifiques dédiés ne sont pas disponibles.

Classements

Domaine#RangScoreSource
Agents & Tools73
51.0
LS
Code Ranking83
65.0
AA
General Ranking54
78.0
AA
Math Reasoning70
84.0
AA
Science62
68.0
AA

Scores de benchmarks (LLM Stats)

Agents

BrowseComp62.0%Aut.
Terminal-Bench47.9%Aut.
Toolathlon43.5%Aut.

Biology

GPQA81.0%Aut.
SciCode39.0%Aut.

Code

VIBE Web91.5%Aut.
VIBE Android89.7%Aut.
VIBE88.6%Aut.
VIBE iOS88.0%Aut.
VIBE Simulation87.1%Aut.
VIBE Backend86.7%Aut.
LiveCodeBench78.0%Aut.
SWE-bench Multilingual72.5%Aut.
SWT-Bench69.3%Aut.
SWE-Bench Verified67.0%Aut.
Multi-SWE-Bench49.4%Aut.
OctoCodingBench26.1%Aut.
SWE-Review8.9%Aut.
SWE-Perf3.1%Aut.

Communication

Tau2 Telecom87.0%Aut.

Finance

MMLU-Pro88.0%Aut.

General

IFBench70.0%Aut.

Long Context

AA-LCR62.0%Aut.

Math

AIME 202581.0%Aut.
Humanity's Last Exam22.0%Aut.

Indices d'évaluation AA

Math Index
82.7
Intelligence Index
39.4
Coding Index
32.8
Mmlu Pro
0.9
Tau2
0.9
Gpqa
0.8
Aime 25
0.8
Livecodebench
0.8
Ifbench
0.7
Lcr
0.6
Scicode
0.4
Terminalbench Hard
0.3
Hle
0.2

Scores par catégorie LLM Stats

Communication
90
Finance
90
Healthcare
90
Language
90
Legal
90
General
80
Tool Calling
70
Frontend Development
70
Instruction Following
70
Biology
60
Chemistry
60
Code
60
Long Context
60
Math
60
Physics
60
Reasoning
60
Search
60
Agents
50
Vision
20

Tarification

Prix d'entrée$0.3 / 1M tokens
Prix de sortie$1.2 / 1M tokens
Prix mixte (3:1)$0.525 / 1M tokens

Vitesse

Tokens/sec86.3 tokens/s
Délai du premier token1.30s
Temps de réponse24.49s

Fournisseurs disponibles

(Unités internes LS)
FournisseurPrix d'entréePrix de sortie
MiniMax300K1.2M

Sources externes