メインコンテンツへスキップ

MiniMax-M2

MiniMaxMiniMaxOpen WeightMIT · Commercial OK

説明

MiniMax M2 is an open-source large language model by MiniMax, built for agents and coding tasks. It delivers state-of-the-art tool use, reasoning, and search performance while maintaining exceptional cost-efficiency and speed, priced at just 8% of Claude 3.5 Sonnet’s cost and running at nearly double its inference speed (≈100 TPS). Designed for end-to-end agentic workflows, it excels at long-chain tool calling across Shell, Browser, Python, and other MCP tools. While slightly behind top overseas models in programming, it ranks among the best domestic models and top five globally on the Artificial Analysis benchmark. M2 powers the MiniMax Agent platform, available in Lightning Mode for fast tasks and Pro Mode for complex multi-step reasoning, and its weights, API, and deployment guides are freely available on Hugging Face, vLLM, and SGLang.

リリース日
2025-10-26
パラメータ
230.0B
コンテキスト長
197K
モダリティ
text

能力レーダー

46
general
49
coding
78
reasoning
50
science推定
80
agents
0
multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ランキング

ドメイン#順位スコアソース
Agents & Tools47
58.0
LS
Code Ranking98
62.0
AA
General Ranking64
76.0
AA
Math Reasoning92
79.0
AA
Reasoning89
49.0
LS
Science136
56.0
AA

ベンチマークスコア (LLM Stats)

Agents

Tau-bench77.2%自己申告
Terminal-Bench46.3%自己申告
BrowseComp44.0%自己申告

Biology

GPQA78.0%自己申告
SciCode36.0%自己申告

Code

LiveCodeBench83.0%自己申告
SWE-Bench Verified69.4%自己申告
SWE-bench Multilingual56.5%自己申告
Multi-SWE-Bench36.2%自己申告

Communication

Tau2 Telecom87.0%自己申告

Finance

MMLU-Pro82.0%自己申告

General

IF72.0%自己申告
AA-Index61.0%自己申告

Math

AIME 202578.0%自己申告
Humanity's Last Exam12.5%自己申告

Reasoning

BrowseComp-zh48.5%自己申告

AA評価指数

Math Index
78.3
Intelligence Index
36.1
Coding Index
29.2
Tau2
0.9
Livecodebench
0.8
Mmlu Pro
0.8
Aime 25
0.8
Gpqa
0.8
Ifbench
0.7
Lcr
0.6
Scicode
0.4
Terminalbench Hard
0.3
Hle
0.1

LLM Statsカテゴリスコア

Communication
90
Tool Calling
80
Finance
80
General
80
Healthcare
80
Language
80
Legal
80
Frontend Development
70
Agents
60
Biology
60
Chemistry
60
Physics
60
Reasoning
60
Code
50
Math
50
Search
50
Vision
10

価格設定

入力価格$0.3 / 1M tokens
出力価格$1.2 / 1M tokens
混合価格(3:1)$0.525 / 1M tokens

速度

トークン/秒85.3 tokens/s
初トークン遅延1.24s
初回答遅延24.68s

利用可能なプロバイダー

(LS内部単位)
プロバイダー入力価格出力価格
Novita300K1.2M
MiniMax300K1.2M

外部リンク