MiniMax-M2.7
MiniMaxMiniMaxOpen WeightMIT · Commercial OK
描述
MiniMax M2.7 features model self-improvement driving productivity innovation. It builds complex agent harnesses independently to accomplish highly complex productivity tasks. M2.7 demonstrates excellent performance in real-world software engineering including end-to-end project delivery, log analysis, code security, and ML tasks. On SWE-Pro it scores 56.22%, nearly matching Opus. It excels in professional office domains achieving the highest ELO among open-source models on GDPval-AA (1495), with significant improvement in complex editing for Office Suite. M2.7 maintains 97% skill adherence on 40 complex skills cases.
发布日期
2026-03-18
参数规模
—
上下文长度
197K
支持模态
text
能力雷达图
45
general
43
coding
87
reasoning
61
science估算
50
agents
0
multimodal
Science 在缺少专门科学评测时使用推理能力代理估算。
排行榜排名
基准测试分数 (LLM Stats)
Agents
GDPval-AA
1494.00 / 3000自报
MLE-Bench Lite
66.6%自报
MM-ClawBench
62.7%自报
Terminal-Bench 2.0
57.0%自报
SWE-Bench Pro
56.2%自报
VIBE-Pro
55.6%自报
Toolathlon
46.3%自报
NL2Repo
39.8%自报
Code
SWE-bench Multilingual
76.5%自报
Multi-SWE-Bench
52.7%自报
General
Artificial Analysis
50.0%自报
AA 评测指数
Intelligence Index49.6
Coding Index41.9
Gpqa0.9
Tau20.8
Ifbench0.8
Lcr0.7
Scicode0.5
Terminalbench Hard0.4
Hle0.3
LLM Stats 分类评分
Finance100
General100
Legal100
Agents100
Reasoning100
Code60
Tool Calling50
Coding40
定价
输入价格$0.3 / 1M tokens
输出价格$1.2 / 1M tokens
混合价格(3:1)$0.525 / 1M tokens
速度
Tokens/秒48.6 tokens/s
首Token延迟1.43s
首回答延迟52.07s
可用提供商
(LS 内部计价单位)| 提供商 | 输入价格 | 输出价格 |
|---|---|---|
| MiniMax | 300K | 1.2M |
| Fireworks | 300K | 1.2M |
| Novita | 300K | 1.2M |