Skip to main content

MiniMax-M2.7

MiniMaxMiniMaxOpen WeightMIT · Commercial OK

Description

MiniMax M2.7 features model self-improvement driving productivity innovation. It builds complex agent harnesses independently to accomplish highly complex productivity tasks. M2.7 demonstrates excellent performance in real-world software engineering including end-to-end project delivery, log analysis, code security, and ML tasks. On SWE-Pro it scores 56.22%, nearly matching Opus. It excels in professional office domains achieving the highest ELO among open-source models on GDPval-AA (1495), with significant improvement in complex editing for Office Suite. M2.7 maintains 97% skill adherence on 40 complex skills cases.

Release Date
2026-03-18
Parameters
Context Length
197K
Modalities
text

Capability Radar

45
general
43
coding
87
reasoning
61
scienceest.
50
agents
0
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Agents & Tools60
54.0
LS
Code Ranking39
75.0
AA
General Ranking25
85.0
AA
Science31
78.0
AA

Benchmark Scores (LLM Stats)

Agents

GDPval-AA1494.00 / 3000SR
MLE-Bench Lite66.6%SR
MM-ClawBench62.7%SR
Terminal-Bench 2.057.0%SR
SWE-Bench Pro56.2%SR
VIBE-Pro55.6%SR
Toolathlon46.3%SR
NL2Repo39.8%SR

Code

SWE-bench Multilingual76.5%SR
Multi-SWE-Bench52.7%SR

General

Artificial Analysis50.0%SR

AA Evaluation Indices

Intelligence Index
49.6
Coding Index
41.9
Gpqa
0.9
Tau2
0.8
Ifbench
0.8
Lcr
0.7
Scicode
0.5
Terminalbench Hard
0.4
Hle
0.3

LLM Stats Category Scores

Finance
100
General
100
Legal
100
Agents
100
Reasoning
100
Code
60
Tool Calling
50
Coding
40

Pricing

Input Price$0.3 / 1M tokens
Output Price$1.2 / 1M tokens
Blended Price (3:1)$0.525 / 1M tokens

Speed

Tokens/sec48.6 tokens/s
Time to First Token1.43s
Time to Answer52.07s

Available Providers

(LS internal units)
ProviderInput PriceOutput Price
MiniMax300K1.2M
Fireworks300K1.2M
Novita300K1.2M

External Sources