Перейти к основному содержанию

MiniMax-M2

MiniMaxMiniMaxOpen WeightMIT · Commercial OK

Описание

MiniMax M2 is an open-source large language model by MiniMax, built for agents and coding tasks. It delivers state-of-the-art tool use, reasoning, and search performance while maintaining exceptional cost-efficiency and speed, priced at just 8% of Claude 3.5 Sonnet’s cost and running at nearly double its inference speed (≈100 TPS). Designed for end-to-end agentic workflows, it excels at long-chain tool calling across Shell, Browser, Python, and other MCP tools. While slightly behind top overseas models in programming, it ranks among the best domestic models and top five globally on the Artificial Analysis benchmark. M2 powers the MiniMax Agent platform, available in Lightning Mode for fast tasks and Pro Mode for complex multi-step reasoning, and its weights, API, and deployment guides are freely available on Hugging Face, vLLM, and SGLang.

Дата выхода
2025-10-26
Параметры
230.0B
Длина контекста
197K
Модальности
text

Радар способностей

46
general
49
coding
78
reasoning
50
scienceоцен.
80
agents
0
multimodal

Science использует прокси на основе рассуждений, когда специализированные научные бенчмарки недоступны.

Рейтинги

Домен#МестоОценкаИсточник
Agents & Tools47
58.0
LS
Code Ranking98
62.0
AA
General Ranking64
76.0
AA
Math Reasoning92
79.0
AA
Reasoning89
49.0
LS
Science136
56.0
AA

Оценки бенчмарков (LLM Stats)

Agents

Tau-bench77.2%Сам.
Terminal-Bench46.3%Сам.
BrowseComp44.0%Сам.

Biology

GPQA78.0%Сам.
SciCode36.0%Сам.

Code

LiveCodeBench83.0%Сам.
SWE-Bench Verified69.4%Сам.
SWE-bench Multilingual56.5%Сам.
Multi-SWE-Bench36.2%Сам.

Communication

Tau2 Telecom87.0%Сам.

Finance

MMLU-Pro82.0%Сам.

General

IF72.0%Сам.
AA-Index61.0%Сам.

Math

AIME 202578.0%Сам.
Humanity's Last Exam12.5%Сам.

Reasoning

BrowseComp-zh48.5%Сам.

Индексы оценки AA

Math Index
78.3
Intelligence Index
36.1
Coding Index
29.2
Tau2
0.9
Livecodebench
0.8
Mmlu Pro
0.8
Aime 25
0.8
Gpqa
0.8
Ifbench
0.7
Lcr
0.6
Scicode
0.4
Terminalbench Hard
0.3
Hle
0.1

Оценки категорий LLM Stats

Communication
90
Tool Calling
80
Finance
80
General
80
Healthcare
80
Language
80
Legal
80
Frontend Development
70
Agents
60
Biology
60
Chemistry
60
Physics
60
Reasoning
60
Code
50
Math
50
Search
50
Vision
10

Цены

Цена ввода$0.3 / 1M tokens
Цена вывода$1.2 / 1M tokens
Смешанная цена (3:1)$0.525 / 1M tokens

Скорость

Токенов/сек85.3 tokens/s
Задержка первого токена1.24s
Время до первого ответа24.68s

Доступные провайдеры

(Внутренние единицы LS)
ПровайдерЦена вводаЦена вывода
Novita300K1.2M
MiniMax300K1.2M

Внешние ссылки