Перейти к основному содержанию

Qwen3.6 Plus

AlibabaQwenProprietary

Описание

Qwen3.6 Plus is Alibaba's next-generation flagship model featuring a 1 million token native context window, up to 65,536 output tokens, and always-on chain-of-thought reasoning. It uses a next-generation hybrid architecture optimized for efficiency and scalability. It leads on Terminal-Bench 2.0 agentic coding (61.6), surpassing Claude 4.5 Opus, and achieves strong results on document understanding (OmniDocBench 91.2) and multimodal reasoning (MMMU 86.0). Compared to Qwen 3.5, it is significantly more decisive in reasoning, using fewer tokens on straightforward tasks with better agent stability.

Дата выхода
2026-04-02
Параметры
Длина контекста
1.0M
Модальности
image, text, video

Радар способностей

45
general
43
coding
88
reasoning
59
scienceоцен.
60
agents
90
multimodal

Science использует прокси на основе рассуждений, когда специализированные научные бенчмарки недоступны.

Рейтинги

Домен#МестоОценкаИсточник
Agents & Tools44
58.0
LS
Code Ranking31
78.0
AA
General Ranking15
88.0
AA
Multimodal Ranking14
87.0
LS
Reasoning28
82.0
LS
Science47
73.0
AA

Оценки бенчмарков (LLM Stats)

Agents

WideSearch74.3%Сам.
MCP Atlas74.1%Сам.
TAU3-Bench70.7%Сам.
OSWorld-Verified62.5%Сам.
TIR-Bench61.6%Сам.
Terminal-Bench 2.061.6%Сам.
Claw-Eval58.7%Сам.
SWE-Bench Pro56.6%Сам.
MCP-Mark48.2%Сам.
SkillsBench45.7%Сам.
VITA-Bench44.3%Сам.
DeepPlanning41.5%Сам.
Toolathlon39.8%Сам.
NL2Repo37.9%Сам.

Biology

GPQA90.4%Сам.

Chemistry

SuperGPQA71.6%Сам.

Code

SWE-Bench Verified78.8%Сам.
SWE-bench Multilingual73.8%Сам.

Finance

MMLU-Pro88.5%Сам.
MMLU-ProX84.7%Сам.

General

MMLU-Redux94.5%Сам.
IFEval94.3%Сам.
C-Eval93.3%Сам.
Global PIQA89.8%Сам.
MMMLU89.5%Сам.
MAXIFE88.2%Сам.
LiveCodeBench v687.1%Сам.
MMMU86.0%Сам.
Include85.1%Сам.
MMStar83.3%Сам.
MMMU-Pro78.8%Сам.
IFBench74.2%Сам.
SimpleVQA0.67 / 100Сам.
LongBench v262.0%Сам.
NOVA-6357.9%Сам.

Grounding

RefCOCO-avg0.94 / 100Сам.
ScreenSpot Pro68.2%Сам.

Healthcare

VideoMMMU84.0%Сам.

Language

WMT24++84.3%Сам.

Long Context

MLVU86.7%Сам.
AA-LCR68.3%Сам.
MMLongBench-Doc0.62 / 100Сам.

Math

HMMT 202596.7%Сам.
AIME 202695.3%Сам.
HMMT2594.6%Сам.
We-Math89.0%Сам.
DynaMath88.0%Сам.
MathVision88.0%Сам.
HMMT Feb 2687.8%Сам.
IMO-AnswerBench83.8%Сам.
PolyMATH77.4%Сам.
Humanity's Last Exam28.8%Сам.

Multimodal

V*96.9%Сам.
AI2D94.4%Сам.
OmniDocBench 1.591.2%Сам.
Video-MME84.2%Сам.
CC-OCR83.4%Сам.
CharXiv-R81.5%Сам.

Reasoning

CountBench0.98 / 100Сам.
ERQA65.7%Сам.

Spatial Reasoning

RealWorldQA85.4%Сам.

Vision

ODinW51.8%Сам.

Индексы оценки AA

Intelligence Index
50.0
Coding Index
42.9
Tau2
1.0
Gpqa
0.9
Ifbench
0.8
Lcr
0.7
Terminalbench Hard
0.4
Scicode
0.4
Hle
0.3

Оценки категорий LLM Stats

Video
90
Biology
90
Language
90
Spatial Reasoning
80
Structured Output
80
Text-to-image
80
Vision
80
Chemistry
80
Finance
80
Frontend Development
80
General
80
Grounding
80
Healthcare
80
Instruction Following
80
Legal
80
Math
80
Multimodal
80
Physics
80
Reasoning
80
Code
70
Economics
70
Image To Text
70
Long Context
70
Search
70
Tool Calling
60
Agents
60
Coding
50

Цены

Цена ввода$0.5 / 1M tokens
Цена вывода$3 / 1M tokens
Смешанная цена (3:1)$1.125 / 1M tokens

Скорость

Токенов/сек52.7 tokens/s
Задержка первого токена1.69s
Время до первого ответа107.01s

Доступные провайдеры

(Внутренние единицы LS)
ПровайдерЦена вводаЦена вывода
Together500K3.0M

Внешние ссылки