MiniMax M1 40k
MiniMaxMiniMaxOpen WeightMIT · Commercial OK
描述
MiniMax-M1 is an open-source, large-scale reasoning model that uses a hybrid-attention architecture for efficient long-context processing. It supports up to a 1 million token context window and 80,000-token reasoning output, matching Gemini 2.5 Pro’s scale while being highly cost-effective. Its Lightning Attention mechanism reduces compute requirements to about 30% of DeepSeek R1’s, and a new reinforcement learning algorithm, CISPO, doubles convergence speed compared to other RL methods. Trained on 512 H800s over three weeks, M1 achieves near state-of-the-art results across software engineering, long-context, and tool-use benchmarks, outperforming most open models and rivaling top closed systems.
發布日期
2025-06-17
參數規模
456.0B
上下文長度
1.0M
支援模態
text
能力雷達圖
37
general
35
coding
49
reasoning
45
science估算
60
agents
0
multimodal
Science 在缺少專門科學評測時使用推理能力代理估算。
排行榜排名
基準測試分數 (LLM Stats)
Biology
GPQA
69.2%自報
Code
LiveCodeBench
62.3%自報
SWE-Bench Verified
55.6%自報
Communication
TAU-bench Retail
67.8%自報
TAU-bench Airline
60.0%自報
Multi-Challenge
44.7%自報
Factuality
SimpleQA
17.9%自報
Finance
MMLU-Pro
80.6%自報
General
LongBench v2
61.0%自報
Long Context
OpenAI-MRCR: 2 needle 128k
76.1%自報
OpenAI-MRCR: 2 needle 1M
58.6%自報
Math
MATH-500
96.0%自報
AIME 2024
83.3%自報
AIME 2025
74.6%自報
Humanity's Last Exam
7.2%自報
Reasoning
ZebraLogic
80.1%自報
AA 評測指數
Intelligence Index20.9
Coding Index14.1
Math Index13.7
Math 5001.0
Aime0.8
Mmlu Pro0.8
Gpqa0.7
Livecodebench0.7
Lcr0.5
Ifbench0.4
Scicode0.4
Tau20.3
Aime 250.1
Hle0.1
Terminalbench Hard0.0
LLM Stats 分類評分
Finance80
Healthcare80
Language80
Legal80
Biology70
Chemistry70
Long Context70
Math70
Physics70
Structured Output60
Tool Calling60
Code60
Communication60
Frontend Development60
General60
Reasoning60
Factuality20
Vision10
定價
輸入價格免費
輸出價格免費
混合價格(3:1)免費
速度
Tokens/秒0.0 tokens/s
首Token延遲0.00s
首回答延遲0.00s
可用提供商
(LS 內部計價單位)暫無提供商資料