Qwen3.6 27B (Reasoning)

AlibabaQwenOpen WeightApache 2.0 · Commercial OK

描述

Qwen3.6-27B is a dense 27-billion-parameter multimodal model in the Qwen3.6 series, supporting both vision-language thinking and non-thinking modes in a single unified checkpoint. The 64-layer language model uses a hybrid layout of 16 repeats of (3 × Gated DeltaNet → FFN, 1 × Gated Attention → FFN) with hidden dim 5120 and FFN intermediate 17408 — Gated DeltaNet has 48/16 heads for V/QK (head dim 128) and Gated Attention has 24/4 heads for Q/KV (head dim 256). It supports a native 262,144-token context extensible to ~1,010,000 via YaRN and is trained with multi-token prediction. The release delivers flagship-level agentic coding, surpassing the previous-generation open-source flagship Qwen3.5-397B-A17B (397B total / 17B active) on every major coding benchmark including SWE-bench Verified (77.2), SWE-bench Pro (53.5), Terminal-Bench 2.0 (59.3), and SkillsBench (48.2), and reaches 87.8 on GPQA Diamond. Released as open weights under Apache 2.0; accessible via Qwen Studio with the Alibaba Cloud Model Studio API coming soon.

发布日期

2026-04-22

参数规模

27.8B

上下文长度

262K

支持模态

image, text, video

能力雷达图

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域	#排名	分数	来源
智能体与工具	42	58.0	LS
代码能力榜	65	68.0	AA
通用能力榜	40	80.0	AA
多模态榜	16	86.0	LS
推理能力	30	81.0	LS
科学能力	61	68.0	AA

基准测试分数 (LLM Stats)

Agents

QwenWebBench

1487.00 / 2000自报

AndroidWorld

70.3%自报

Claw-Eval

60.6%自报

Terminal-Bench 2.0

59.3%自报

SWE-Bench Pro

53.5%自报

ZClawBench

53.4%自报

SkillsBench

48.2%自报

NL2Repo

36.2%自报

Biology

GPQA

87.8%自报

Chemistry

SuperGPQA

66.0%自报

Code

SWE-Bench Verified

77.2%自报

SWE-bench Multilingual

71.3%自报

Embodied

EmbSpatialBench

0.85 / 100自报

Finance

MMLU-Pro

86.2%自报

General

MMLU-Redux

93.5%自报

C-Eval

91.4%自报

LiveCodeBench v6

83.9%自报

MMMU

82.9%自报

MMStar

81.4%自报

MMMU-Pro

75.8%自报

SimpleVQA

0.56 / 100自报

Grounding

RefCOCO-avg

0.93 / 100自报

RefSpatialBench

0.70 / 100自报

Healthcare

VideoMMMU

84.4%自报

Image To Text

OCRBench

89.4%自报

Long Context

MLVU

86.6%自报

Math

AIME 2026

94.1%自报

HMMT 2025

93.8%自报

HMMT25

90.7%自报

MathVista-Mini

87.4%自报

DynaMath

85.6%自报

HMMT Feb 26

84.3%自报

IMO-AnswerBench

80.8%自报

Humanity's Last Exam

24.0%自报

Multimodal

VLMsAreBlind

97.0%自报

94.7%自报

MMBench-V1.1

92.3%自报

VideoMME w sub.

87.7%自报

CC-OCR

81.2%自报

CharXiv-R

78.4%自报

MVBench

75.5%自报

Reasoning

CountBench

0.98 / 100自报

ERQA

62.5%自报

Spatial Reasoning

RealWorldQA

84.1%自报

AA 评测指数

Intelligence Index

45.8

Coding Index

36.5

Tau2

0.9

Gpqa

0.8

Lcr

0.7

Ifbench

0.7

Scicode

0.4

Terminalbench Hard

0.3

Hle

0.2

LLM Stats 分类评分

Biology

Language

Long Context

Spatial Reasoning

Structured Output

Text-to-image

Video

Vision

Chemistry

Embodied

Finance

Frontend Development

General

Grounding

Healthcare

Legal

Math

Multimodal

Physics

Reasoning

Code

Economics

Image To Text

Tool Calling

Agents

Coding

定价

输入价格$0.6 / 1M tokens

输出价格$3.6 / 1M tokens

混合价格(3:1)$1.35 / 1M tokens

速度

Tokens/秒67.7 tokens/s

首Token延迟1.45s

首回答延迟31.00s

可用提供商

(LS 内部计价单位)

提供商	输入价格	输出价格
Novita	600K	3.6M

外部链接

LLM Stats