Qwen3.6 27B (Reasoning)

AlibabaQwen開源權重Apache 2.0 · 商用許可

描述

Qwen3.6-27B is a dense 27-billion-parameter multimodal model in the Qwen3.6 series, supporting both vision-language thinking and non-thinking modes in a single unified checkpoint. The 64-layer language model uses a hybrid layout of 16 repeats of (3 × Gated DeltaNet → FFN, 1 × Gated Attention → FFN) with hidden dim 5120 and FFN intermediate 17408 — Gated DeltaNet has 48/16 heads for V/QK (head dim 128) and Gated Attention has 24/4 heads for Q/KV (head dim 256). It supports a native 262,144-token context extensible to ~1,010,000 via YaRN and is trained with multi-token prediction. The release delivers flagship-level agentic coding, surpassing the previous-generation open-source flagship Qwen3.5-397B-A17B (397B total / 17B active) on every major coding benchmark including SWE-bench Verified (77.2), SWE-bench Pro (53.5), Terminal-Bench 2.0 (59.3), and SkillsBench (48.2), and reaches 87.8 on GPQA Diamond. Released as open weights under Apache 2.0; accessible via Qwen Studio with the Alibaba Cloud Model Studio API coming soon.

發布日期

2026-04-22

參數規模

27.8B

上下文長度

262K

支援模態

audio, image, text, video

能力雷達圖

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域	#排名	分數	來源
智慧體能力模型榜	65	54.0	LS
程式碼能力榜	69	72.0	AA
通用能力榜	55	74.0	AA
多模態榜	18	86.0	LS
推理能力	32	81.0	LS
科學能力	79	64.0	AA

基準測試分數 (LLM Stats)

Agents

QwenWebBench

1487.00 / 2000自報

GDPval-AA

1158.00 / 3000自報

AndroidWorld

70.3%自報

Claw-Eval

60.6%自報

Terminal-Bench 2.0

59.3%自報

SWE-Bench Pro

53.5%自報

ZClawBench

53.4%自報

SkillsBench

48.2%自報

NL2Repo

36.2%自報

Biology

GPQA

87.8%自報

Chemistry

SuperGPQA

66.0%自報

Code

SWE-Bench Verified

77.2%自報

SWE-bench Multilingual

71.3%自報

Embodied

EmbSpatialBench

0.85 / 100自報

Finance

MMLU-Pro

86.2%自報

General

MMLU-Redux

93.5%自報

C-Eval

91.4%自報

LiveCodeBench v6

83.9%自報

MMMU

82.9%自報

MMStar

81.4%自報

MMMU-Pro

75.8%自報

SimpleVQA

0.56 / 100自報

Grounding

RefCOCO-avg

0.93 / 100自報

RefSpatialBench

0.70 / 100自報

Healthcare

VideoMMMU

84.4%自報

Image To Text

OCRBench

89.4%自報

Long Context

MLVU

86.6%自報

Math

AIME 2026

94.1%自報

HMMT 2025

93.8%自報

HMMT25

90.7%自報

MathVista-Mini

87.4%自報

DynaMath

85.6%自報

HMMT Feb 26

84.3%自報

IMO-AnswerBench

80.8%自報

Humanity's Last Exam

24.0%自報

Multimodal

VLMsAreBlind

97.0%自報

94.7%自報

MMBench-V1.1

92.3%自報

VideoMME w sub.

87.7%自報

CC-OCR

81.2%自報

CharXiv-R

78.4%自報

MVBench

75.5%自報

Reasoning

CountBench

0.98 / 100自報

ERQA

62.5%自報

Spatial Reasoning

RealWorldQA

84.1%自報

AA 評測指數

Coding Index

53.7

Intelligence Index

37.1

Tau2

0.9

Gpqa

0.8

Lcr

0.7

Ifbench

0.7

Terminalbench V2 1

0.6

Scicode

0.4

Terminalbench Hard

0.3

Hle

0.2

Tau Banking

0.2

LLM Stats 分類評分

Legal

100

Finance

100

Agents

100

General

100

Reasoning

Language

Long Context

Biology

Math

Multimodal

Physics

Spatial Reasoning

Structured Output

Embodied

Frontend Development

Grounding

Healthcare

Chemistry

Text-to-image

Video

Vision

Image To Text

Code

Economics

Tool Calling

Coding

定價

輸入價格$0.6 / 1M tokens

輸出價格$3.6 / 1M tokens

混合價格(3:1)$1.35 / 1M tokens

速度

Tokens/秒65.4

首Token延遲1.33s

首回答延遲88.14s

供應商價格排行

9 個供應商

最便宜: Novita最貴: routing.run

供應商輸入輸出

1Novita最便宜

2Chutes

$0.195

$1.56

3NanoGPT

$0.203

$2.24

4OpenRouter

$0.2885

$3.17

5Kilo Gateway

$0.325

$3.25

6Venice AI

$0.325

$3.25

7Alibaba主要

$0.6

$3.6

8Vercel AI Gateway

$0.6

$3.6

9routing.run

$1.1

$3.3

比較該模型在不同 API 供應商之間的定價。

外部連結

LLM Stats Artificial Analysis