메인 콘텐츠로 건너뛰기

Qwen3.6 27B (Reasoning)

AlibabaQwenOpen WeightApache 2.0 · Commercial OK

설명

Qwen3.6-27B is a dense 27-billion-parameter multimodal model in the Qwen3.6 series, supporting both vision-language thinking and non-thinking modes in a single unified checkpoint. The 64-layer language model uses a hybrid layout of 16 repeats of (3 × Gated DeltaNet → FFN, 1 × Gated Attention → FFN) with hidden dim 5120 and FFN intermediate 17408 — Gated DeltaNet has 48/16 heads for V/QK (head dim 128) and Gated Attention has 24/4 heads for Q/KV (head dim 256). It supports a native 262,144-token context extensible to ~1,010,000 via YaRN and is trained with multi-token prediction. The release delivers flagship-level agentic coding, surpassing the previous-generation open-source flagship Qwen3.5-397B-A17B (397B total / 17B active) on every major coding benchmark including SWE-bench Verified (77.2), SWE-bench Pro (53.5), Terminal-Bench 2.0 (59.3), and SkillsBench (48.2), and reaches 87.8 on GPQA Diamond. Released as open weights under Apache 2.0; accessible via Qwen Studio with the Alibaba Cloud Model Studio API coming soon.

출시일
2026-04-22
파라미터
27.8B
컨텍스트 길이
262K
모달리티
image, text, video

능력 레이더

41
general
37
coding
84
reasoning
56
science추정
60
agents
80
multimodal

전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.

랭킹

도메인#순위점수소스
Agents & Tools42
58.0
LS
Code Ranking65
68.0
AA
General Ranking40
80.0
AA
Multimodal Ranking16
86.0
LS
Reasoning30
81.0
LS
Science61
68.0
AA

벤치마크 점수 (LLM Stats)

Agents

QwenWebBench1487.00 / 2000자체 보고
AndroidWorld70.3%자체 보고
Claw-Eval60.6%자체 보고
Terminal-Bench 2.059.3%자체 보고
SWE-Bench Pro53.5%자체 보고
ZClawBench53.4%자체 보고
SkillsBench48.2%자체 보고
NL2Repo36.2%자체 보고

Biology

GPQA87.8%자체 보고

Chemistry

SuperGPQA66.0%자체 보고

Code

SWE-Bench Verified77.2%자체 보고
SWE-bench Multilingual71.3%자체 보고

Embodied

EmbSpatialBench0.85 / 100자체 보고

Finance

MMLU-Pro86.2%자체 보고

General

MMLU-Redux93.5%자체 보고
C-Eval91.4%자체 보고
LiveCodeBench v683.9%자체 보고
MMMU82.9%자체 보고
MMStar81.4%자체 보고
MMMU-Pro75.8%자체 보고
SimpleVQA0.56 / 100자체 보고

Grounding

RefCOCO-avg0.93 / 100자체 보고
RefSpatialBench0.70 / 100자체 보고

Healthcare

VideoMMMU84.4%자체 보고

Image To Text

OCRBench89.4%자체 보고

Long Context

MLVU86.6%자체 보고

Math

AIME 202694.1%자체 보고
HMMT 202593.8%자체 보고
HMMT2590.7%자체 보고
MathVista-Mini87.4%자체 보고
DynaMath85.6%자체 보고
HMMT Feb 2684.3%자체 보고
IMO-AnswerBench80.8%자체 보고
Humanity's Last Exam24.0%자체 보고

Multimodal

VLMsAreBlind97.0%자체 보고
V*94.7%자체 보고
MMBench-V1.192.3%자체 보고
VideoMME w sub.87.7%자체 보고
CC-OCR81.2%자체 보고
CharXiv-R78.4%자체 보고
MVBench75.5%자체 보고

Reasoning

CountBench0.98 / 100자체 보고
ERQA62.5%자체 보고

Spatial Reasoning

RealWorldQA84.1%자체 보고

AA 평가 지수

Intelligence Index
45.8
Coding Index
36.5
Tau2
0.9
Gpqa
0.8
Lcr
0.7
Ifbench
0.7
Scicode
0.4
Terminalbench Hard
0.3
Hle
0.2

LLM Stats 카테고리 점수

Biology
90
Language
90
Long Context
90
Spatial Reasoning
80
Structured Output
80
Text-to-image
80
Video
80
Vision
80
Chemistry
80
Embodied
80
Finance
80
Frontend Development
80
General
80
Grounding
80
Healthcare
80
Legal
80
Math
80
Multimodal
80
Physics
80
Reasoning
80
Code
70
Economics
70
Image To Text
70
Tool Calling
60
Agents
50
Coding
50

가격

입력 가격$0.6 / 1M tokens
출력 가격$3.6 / 1M tokens
혼합 가격 (3:1)$1.35 / 1M tokens

속도

토큰/초67.7 tokens/s
첫 토큰 지연1.45s
첫 응답 지연31.00s

사용 가능한 프로바이더

(LS 내부 단위)
프로바이더입력 가격출력 가격
Novita600K3.6M

외부 링크