Skip to main content

MiniMax M1 80k

MiniMaxMiniMaxOpen WeightMIT · Commercial OK

Description

MiniMax-M1 is an open-source, large-scale reasoning model that uses a hybrid-attention architecture for efficient long-context processing. It supports up to a 1 million token context window and 80,000-token reasoning output, matching Gemini 2.5 Pro’s scale while being highly cost-effective. Its Lightning Attention mechanism reduces compute requirements to about 30% of DeepSeek R1’s, and a new reinforcement learning algorithm, CISPO, doubles convergence speed compared to other RL methods. Trained on 512 H800s over three weeks, M1 achieves near state-of-the-art results across software engineering, long-context, and tool-use benchmarks, outperforming most open models and rivaling top closed systems.

Release Date
2025-06-17
Parameters
456.0B
Context Length
1.0M
Modalities
text

Capability Radar

39
general
37
coding
73
reasoning
46
scienceest.
60
agents
0
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking202
41.0
AA
General Ranking220
47.0
AA
Math Reasoning102
75.0
AA
Reasoning17
87.0
LS
Science180
50.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA70.0%SR

Code

LiveCodeBench65.0%SR
SWE-Bench Verified56.0%SR

Communication

TAU-bench Retail63.5%SR
TAU-bench Airline62.0%SR
Multi-Challenge44.7%SR

Factuality

SimpleQA18.5%SR

Finance

MMLU-Pro81.1%SR

General

LongBench v261.5%SR

Long Context

OpenAI-MRCR: 2 needle 128k73.4%SR
OpenAI-MRCR: 2 needle 1M56.2%SR

Math

MATH-50096.8%SR
AIME 202486.0%SR
AIME 202576.9%SR
Humanity's Last Exam8.4%SR

Reasoning

ZebraLogic86.8%SR

AA Evaluation Indices

Math Index
61.0
Intelligence Index
24.4
Coding Index
14.5
Math 500
1.0
Aime
0.8
Mmlu Pro
0.8
Livecodebench
0.7
Gpqa
0.7
Aime 25
0.6
Lcr
0.5
Ifbench
0.4
Scicode
0.4
Tau2
0.3
Hle
0.1
Terminalbench Hard
0.0

LLM Stats Category Scores

Finance
80
Healthcare
80
Language
80
Legal
80
Biology
70
Chemistry
70
Math
70
Physics
70
Structured Output
60
Tool Calling
60
Code
60
Communication
60
Frontend Development
60
General
60
Long Context
60
Reasoning
60
Factuality
20
Vision
10

Pricing

Input Price$0.55 / 1M tokens
Output Price$2.2 / 1M tokens
Blended Price (3:1)$0.963 / 1M tokens

Speed

Tokens/sec0.0 tokens/s
Time to First Token0.00s
Time to Answer0.00s

Available Providers

(LS internal units)

No provider data available

External Sources