Qwen3.6 27B (Reasoning)

AlibabaQwenOpen WeightApache 2.0 · Commercial OK

Description

Qwen3.6-27B is a dense 27-billion-parameter multimodal model in the Qwen3.6 series, supporting both vision-language thinking and non-thinking modes in a single unified checkpoint. The 64-layer language model uses a hybrid layout of 16 repeats of (3 × Gated DeltaNet → FFN, 1 × Gated Attention → FFN) with hidden dim 5120 and FFN intermediate 17408 — Gated DeltaNet has 48/16 heads for V/QK (head dim 128) and Gated Attention has 24/4 heads for Q/KV (head dim 256). It supports a native 262,144-token context extensible to ~1,010,000 via YaRN and is trained with multi-token prediction. The release delivers flagship-level agentic coding, surpassing the previous-generation open-source flagship Qwen3.5-397B-A17B (397B total / 17B active) on every major coding benchmark including SWE-bench Verified (77.2), SWE-bench Pro (53.5), Terminal-Bench 2.0 (59.3), and SkillsBench (48.2), and reaches 87.8 on GPQA Diamond. Released as open weights under Apache 2.0; accessible via Qwen Studio with the Alibaba Cloud Model Studio API coming soon.

Release Date

2026-04-22

Parameters

27.8B

Context Length

262K

Modalities

audio, image, text, video

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Agentic Capability	65	54.0	LS
Code Ranking	69	72.0	AA
General Ranking	55	74.0	AA
Multimodal Ranking	18	86.0	LS
Reasoning	32	81.0	LS
Science	79	64.0	AA

Benchmark Scores (LLM Stats)

Agents

QwenWebBench

1487.00 / 2000SR

GDPval-AA

1158.00 / 3000SR

AndroidWorld

70.3%SR

Claw-Eval

60.6%SR

Terminal-Bench 2.0

59.3%SR

SWE-Bench Pro

53.5%SR

ZClawBench

53.4%SR

SkillsBench

48.2%SR

NL2Repo

36.2%SR

Biology

GPQA

87.8%SR

Chemistry

SuperGPQA

66.0%SR

Code

SWE-Bench Verified

77.2%SR

SWE-bench Multilingual

71.3%SR

Embodied

EmbSpatialBench

0.85 / 100SR

Finance

MMLU-Pro

86.2%SR

General

MMLU-Redux

93.5%SR

C-Eval

91.4%SR

LiveCodeBench v6

83.9%SR

MMMU

82.9%SR

MMStar

81.4%SR

MMMU-Pro

75.8%SR

SimpleVQA

0.56 / 100SR

Grounding

RefCOCO-avg

0.93 / 100SR

RefSpatialBench

0.70 / 100SR

Healthcare

VideoMMMU

84.4%SR

Image To Text

OCRBench

89.4%SR

Long Context

MLVU

86.6%SR

Math

AIME 2026

94.1%SR

HMMT 2025

93.8%SR

HMMT25

90.7%SR

MathVista-Mini

87.4%SR

DynaMath

85.6%SR

HMMT Feb 26

84.3%SR

IMO-AnswerBench

80.8%SR

Humanity's Last Exam

24.0%SR

Multimodal

VLMsAreBlind

97.0%SR

94.7%SR

MMBench-V1.1

92.3%SR

VideoMME w sub.

87.7%SR

CC-OCR

81.2%SR

CharXiv-R

78.4%SR

MVBench

75.5%SR

Reasoning

CountBench

0.98 / 100SR

ERQA

62.5%SR

Spatial Reasoning

RealWorldQA

84.1%SR

AA Evaluation Indices

Coding Index

53.7

Intelligence Index

37.1

Tau2

0.9

Gpqa

0.8

Lcr

0.7

Ifbench

0.7

Terminalbench V2 1

0.6

Scicode

0.4

Terminalbench Hard

0.3

Hle

0.2

Tau Banking

0.2

LLM Stats Category Scores

Legal

100

Finance

100

Agents

100

General

100

Reasoning

Language

Long Context

Biology

Math

Multimodal

Physics

Spatial Reasoning

Structured Output

Embodied

Frontend Development

Grounding

Healthcare

Chemistry

Text-to-image

Video

Vision

Image To Text

Code

Economics

Tool Calling

Coding

Pricing

Input Price$0.6 / 1M tokens

Output Price$3.6 / 1M tokens

Blended Price (3:1)$1.35 / 1M tokens

Speed

Tokens/sec65.4

Time to First Token1.33s

Time to Answer88.14s

Provider Price Ranking

9 providers

Cheapest: NovitaMost Expensive: routing.run

ProviderInputOutput

1NovitaCheapest

2Chutes

$0.195

$1.56

3NanoGPT

$0.203

$2.24

4OpenRouter

$0.2885

$3.17

5Kilo Gateway

$0.325

$3.25

6Venice AI

$0.325

$3.25

7AlibabaPRIMARY

$0.6

$3.6

8Vercel AI Gateway

$0.6

$3.6

9routing.run

$1.1

$3.3

Compare pricing across different API providers for this model.

External Sources

LLM Stats Artificial Analysis