Перейти к основному содержанию

Qwen3.6 27B (Reasoning)

AlibabaQwenOpen WeightApache 2.0 · Commercial OK

Описание

Qwen3.6-27B is a dense 27-billion-parameter multimodal model in the Qwen3.6 series, supporting both vision-language thinking and non-thinking modes in a single unified checkpoint. The 64-layer language model uses a hybrid layout of 16 repeats of (3 × Gated DeltaNet → FFN, 1 × Gated Attention → FFN) with hidden dim 5120 and FFN intermediate 17408 — Gated DeltaNet has 48/16 heads for V/QK (head dim 128) and Gated Attention has 24/4 heads for Q/KV (head dim 256). It supports a native 262,144-token context extensible to ~1,010,000 via YaRN and is trained with multi-token prediction. The release delivers flagship-level agentic coding, surpassing the previous-generation open-source flagship Qwen3.5-397B-A17B (397B total / 17B active) on every major coding benchmark including SWE-bench Verified (77.2), SWE-bench Pro (53.5), Terminal-Bench 2.0 (59.3), and SkillsBench (48.2), and reaches 87.8 on GPQA Diamond. Released as open weights under Apache 2.0; accessible via Qwen Studio with the Alibaba Cloud Model Studio API coming soon.

Дата выхода
2026-04-22
Параметры
27.8B
Длина контекста
262K
Модальности
image, text, video

Радар способностей

41
general
37
coding
84
reasoning
56
scienceоцен.
60
agents
80
multimodal

Science использует прокси на основе рассуждений, когда специализированные научные бенчмарки недоступны.

Рейтинги

Домен#МестоОценкаИсточник
Agents & Tools42
58.0
LS
Code Ranking65
68.0
AA
General Ranking40
80.0
AA
Multimodal Ranking16
86.0
LS
Reasoning30
81.0
LS
Science61
68.0
AA

Оценки бенчмарков (LLM Stats)

Agents

QwenWebBench1487.00 / 2000Сам.
AndroidWorld70.3%Сам.
Claw-Eval60.6%Сам.
Terminal-Bench 2.059.3%Сам.
SWE-Bench Pro53.5%Сам.
ZClawBench53.4%Сам.
SkillsBench48.2%Сам.
NL2Repo36.2%Сам.

Biology

GPQA87.8%Сам.

Chemistry

SuperGPQA66.0%Сам.

Code

SWE-Bench Verified77.2%Сам.
SWE-bench Multilingual71.3%Сам.

Embodied

EmbSpatialBench0.85 / 100Сам.

Finance

MMLU-Pro86.2%Сам.

General

MMLU-Redux93.5%Сам.
C-Eval91.4%Сам.
LiveCodeBench v683.9%Сам.
MMMU82.9%Сам.
MMStar81.4%Сам.
MMMU-Pro75.8%Сам.
SimpleVQA0.56 / 100Сам.

Grounding

RefCOCO-avg0.93 / 100Сам.
RefSpatialBench0.70 / 100Сам.

Healthcare

VideoMMMU84.4%Сам.

Image To Text

OCRBench89.4%Сам.

Long Context

MLVU86.6%Сам.

Math

AIME 202694.1%Сам.
HMMT 202593.8%Сам.
HMMT2590.7%Сам.
MathVista-Mini87.4%Сам.
DynaMath85.6%Сам.
HMMT Feb 2684.3%Сам.
IMO-AnswerBench80.8%Сам.
Humanity's Last Exam24.0%Сам.

Multimodal

VLMsAreBlind97.0%Сам.
V*94.7%Сам.
MMBench-V1.192.3%Сам.
VideoMME w sub.87.7%Сам.
CC-OCR81.2%Сам.
CharXiv-R78.4%Сам.
MVBench75.5%Сам.

Reasoning

CountBench0.98 / 100Сам.
ERQA62.5%Сам.

Spatial Reasoning

RealWorldQA84.1%Сам.

Индексы оценки AA

Intelligence Index
45.8
Coding Index
36.5
Tau2
0.9
Gpqa
0.8
Lcr
0.7
Ifbench
0.7
Scicode
0.4
Terminalbench Hard
0.3
Hle
0.2

Оценки категорий LLM Stats

Biology
90
Language
90
Long Context
90
Spatial Reasoning
80
Structured Output
80
Text-to-image
80
Video
80
Vision
80
Chemistry
80
Embodied
80
Finance
80
Frontend Development
80
General
80
Grounding
80
Healthcare
80
Legal
80
Math
80
Multimodal
80
Physics
80
Reasoning
80
Code
70
Economics
70
Image To Text
70
Tool Calling
60
Agents
50
Coding
50

Цены

Цена ввода$0.6 / 1M tokens
Цена вывода$3.6 / 1M tokens
Смешанная цена (3:1)$1.35 / 1M tokens

Скорость

Токенов/сек67.7 tokens/s
Задержка первого токена1.45s
Время до первого ответа31.00s

Доступные провайдеры

(Внутренние единицы LS)
ПровайдерЦена вводаЦена вывода
Novita600K3.6M

Внешние ссылки