跳轉到主要內容

MiniMax-M2.1

MiniMaxMiniMaxOpen WeightMIT · Commercial OK

描述

MiniMax M2.1 is an enhanced large language model focused on multi-language programming and real-world complex tasks. It features exceptional capabilities across Rust, Java, Golang, C++, Kotlin, Objective-C, TypeScript, JavaScript and more, with industry-leading multilingual performance that outperforms Claude Sonnet 4.5 and approaches Claude Opus 4.5. M2.1 significantly strengthens native Android and iOS development, delivers enhanced design comprehension and aesthetic expression for web/app scenarios, and provides more concise responses with improved speed and reduced token consumption. It excels across various coding agent frameworks including Claude Code, Droid (Factory AI), Cline, Kilo Code, Roo Code, and BlackBox.

發布日期
2025-12-23
參數規模
230.0B
上下文長度
197K
支援模態
text

能力雷達圖

51
general
50
coding
83
reasoning
56
science估算
70
agents
0
multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域#排名分數來源
智能体与工具73
51.0
LS
代码能力榜83
65.0
AA
通用能力榜54
78.0
AA
数学推理70
84.0
AA
科学能力62
68.0
AA

基準測試分數 (LLM Stats)

Agents

BrowseComp62.0%自報
Terminal-Bench47.9%自報
Toolathlon43.5%自報

Biology

GPQA81.0%自報
SciCode39.0%自報

Code

VIBE Web91.5%自報
VIBE Android89.7%自報
VIBE88.6%自報
VIBE iOS88.0%自報
VIBE Simulation87.1%自報
VIBE Backend86.7%自報
LiveCodeBench78.0%自報
SWE-bench Multilingual72.5%自報
SWT-Bench69.3%自報
SWE-Bench Verified67.0%自報
Multi-SWE-Bench49.4%自報
OctoCodingBench26.1%自報
SWE-Review8.9%自報
SWE-Perf3.1%自報

Communication

Tau2 Telecom87.0%自報

Finance

MMLU-Pro88.0%自報

General

IFBench70.0%自報

Long Context

AA-LCR62.0%自報

Math

AIME 202581.0%自報
Humanity's Last Exam22.0%自報

AA 評測指數

Math Index
82.7
Intelligence Index
39.4
Coding Index
32.8
Mmlu Pro
0.9
Tau2
0.9
Gpqa
0.8
Aime 25
0.8
Livecodebench
0.8
Ifbench
0.7
Lcr
0.6
Scicode
0.4
Terminalbench Hard
0.3
Hle
0.2

LLM Stats 分類評分

Communication
90
Finance
90
Healthcare
90
Language
90
Legal
90
General
80
Tool Calling
70
Frontend Development
70
Instruction Following
70
Biology
60
Chemistry
60
Code
60
Long Context
60
Math
60
Physics
60
Reasoning
60
Search
60
Agents
50
Vision
20

定價

輸入價格$0.3 / 1M tokens
輸出價格$1.2 / 1M tokens
混合價格(3:1)$0.525 / 1M tokens

速度

Tokens/秒86.3 tokens/s
首Token延遲1.30s
首回答延遲24.49s

可用提供商

(LS 內部計價單位)
提供商輸入價格輸出價格
MiniMax300K1.2M

外部連結