MiniMax M1 40k
MiniMaxMiniMaxOpen WeightMIT · Commercial OK
描述
MiniMax-M1 is an open-source, large-scale reasoning model that uses a hybrid-attention architecture for efficient long-context processing. It supports up to a 1 million token context window and 80,000-token reasoning output, matching Gemini 2.5 Pro’s scale while being highly cost-effective. Its Lightning Attention mechanism reduces compute requirements to about 30% of DeepSeek R1’s, and a new reinforcement learning algorithm, CISPO, doubles convergence speed compared to other RL methods. Trained on 512 H800s over three weeks, M1 achieves near state-of-the-art results across software engineering, long-context, and tool-use benchmarks, outperforming most open models and rivaling top closed systems.
发布日期
2025-06-17
参数规模
456.0B
上下文长度
1.0M
支持模态
text
能力雷达图
37
general
35
coding
49
reasoning
45
science估算
60
agents
0
multimodal
Science 在缺少专门科学评测时使用推理能力代理估算。
排行榜排名
基准测试分数 (LLM Stats)
Biology
GPQA
69.2%自报
Code
LiveCodeBench
62.3%自报
SWE-Bench Verified
55.6%自报
Communication
TAU-bench Retail
67.8%自报
TAU-bench Airline
60.0%自报
Multi-Challenge
44.7%自报
Factuality
SimpleQA
17.9%自报
Finance
MMLU-Pro
80.6%自报
General
LongBench v2
61.0%自报
Long Context
OpenAI-MRCR: 2 needle 128k
76.1%自报
OpenAI-MRCR: 2 needle 1M
58.6%自报
Math
MATH-500
96.0%自报
AIME 2024
83.3%自报
AIME 2025
74.6%自报
Humanity's Last Exam
7.2%自报
Reasoning
ZebraLogic
80.1%自报
AA 评测指数
Intelligence Index20.9
Coding Index14.1
Math Index13.7
Math 5001.0
Aime0.8
Mmlu Pro0.8
Gpqa0.7
Livecodebench0.7
Lcr0.5
Ifbench0.4
Scicode0.4
Tau20.3
Aime 250.1
Hle0.1
Terminalbench Hard0.0
LLM Stats 分类评分
Finance80
Healthcare80
Language80
Legal80
Biology70
Chemistry70
Long Context70
Math70
Physics70
Structured Output60
Tool Calling60
Code60
Communication60
Frontend Development60
General60
Reasoning60
Factuality20
Vision10
定价
输入价格免费
输出价格免费
混合价格(3:1)免费
速度
Tokens/秒0.0 tokens/s
首Token延迟0.00s
首回答延迟0.00s
可用提供商
(LS 内部计价单位)暂无提供商数据