MiniMax-M2.7
MiniMaxMiniMaxOpen WeightMIT · Commercial OK
描述
MiniMax M2.7 features model self-improvement driving productivity innovation. It builds complex agent harnesses independently to accomplish highly complex productivity tasks. M2.7 demonstrates excellent performance in real-world software engineering including end-to-end project delivery, log analysis, code security, and ML tasks. On SWE-Pro it scores 56.22%, nearly matching Opus. It excels in professional office domains achieving the highest ELO among open-source models on GDPval-AA (1495), with significant improvement in complex editing for Office Suite. M2.7 maintains 97% skill adherence on 40 complex skills cases.
發布日期
2026-03-18
參數規模
—
上下文長度
197K
支援模態
text
能力雷達圖
45
general
43
coding
87
reasoning
61
science估算
50
agents
0
multimodal
Science 在缺少專門科學評測時使用推理能力代理估算。
排行榜排名
基準測試分數 (LLM Stats)
Agents
GDPval-AA
1494.00 / 3000自報
MLE-Bench Lite
66.6%自報
MM-ClawBench
62.7%自報
Terminal-Bench 2.0
57.0%自報
SWE-Bench Pro
56.2%自報
VIBE-Pro
55.6%自報
Toolathlon
46.3%自報
NL2Repo
39.8%自報
Code
SWE-bench Multilingual
76.5%自報
Multi-SWE-Bench
52.7%自報
General
Artificial Analysis
50.0%自報
AA 評測指數
Intelligence Index49.6
Coding Index41.9
Gpqa0.9
Tau20.8
Ifbench0.8
Lcr0.7
Scicode0.5
Terminalbench Hard0.4
Hle0.3
LLM Stats 分類評分
Finance100
General100
Legal100
Agents100
Reasoning100
Code60
Tool Calling50
Coding40
定價
輸入價格$0.3 / 1M tokens
輸出價格$1.2 / 1M tokens
混合價格(3:1)$0.525 / 1M tokens
速度
Tokens/秒48.6 tokens/s
首Token延遲1.43s
首回答延遲52.07s
可用提供商
(LS 內部計價單位)| 提供商 | 輸入價格 | 輸出價格 |
|---|---|---|
| MiniMax | 300K | 1.2M |
| Fireworks | 300K | 1.2M |
| Novita | 300K | 1.2M |