Qwen3.6 27B (Reasoning)

AlibabaQwen오픈 웨이트Apache 2.0 · 상업적 사용 가능

설명

Qwen3.6-27B is a dense 27-billion-parameter multimodal model in the Qwen3.6 series, supporting both vision-language thinking and non-thinking modes in a single unified checkpoint. The 64-layer language model uses a hybrid layout of 16 repeats of (3 × Gated DeltaNet → FFN, 1 × Gated Attention → FFN) with hidden dim 5120 and FFN intermediate 17408 — Gated DeltaNet has 48/16 heads for V/QK (head dim 128) and Gated Attention has 24/4 heads for Q/KV (head dim 256). It supports a native 262,144-token context extensible to ~1,010,000 via YaRN and is trained with multi-token prediction. The release delivers flagship-level agentic coding, surpassing the previous-generation open-source flagship Qwen3.5-397B-A17B (397B total / 17B active) on every major coding benchmark including SWE-bench Verified (77.2), SWE-bench Pro (53.5), Terminal-Bench 2.0 (59.3), and SkillsBench (48.2), and reaches 87.8 on GPQA Diamond. Released as open weights under Apache 2.0; accessible via Qwen Studio with the Alibaba Cloud Model Studio API coming soon.

출시일

2026-04-22

파라미터

27.8B

컨텍스트 길이

262K

모달리티

audio, image, text, video

능력 레이더

general

coding

reasoning

science추정

agents

multimodal

전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.

랭킹

도메인	#순위	점수	소스
에이전트형 역량	65	54.0	LS
코딩 랭킹	69	72.0	AA
종합 랭킹	55	74.0	AA
멀티모달 랭킹	18	86.0	LS
추론	32	81.0	LS
과학	79	64.0	AA

벤치마크 점수 (LLM Stats)

Agents

QwenWebBench

1487.00 / 2000자체 보고

GDPval-AA

1158.00 / 3000자체 보고

AndroidWorld

70.3%자체 보고

Claw-Eval

60.6%자체 보고

Terminal-Bench 2.0

59.3%자체 보고

SWE-Bench Pro

53.5%자체 보고

ZClawBench

53.4%자체 보고

SkillsBench

48.2%자체 보고

NL2Repo

36.2%자체 보고

Biology

GPQA

87.8%자체 보고

Chemistry

SuperGPQA

66.0%자체 보고

Code

SWE-Bench Verified

77.2%자체 보고

SWE-bench Multilingual

71.3%자체 보고

Embodied

EmbSpatialBench

0.85 / 100자체 보고

Finance

MMLU-Pro

86.2%자체 보고

General

MMLU-Redux

93.5%자체 보고

C-Eval

91.4%자체 보고

LiveCodeBench v6

83.9%자체 보고

MMMU

82.9%자체 보고

MMStar

81.4%자체 보고

MMMU-Pro

75.8%자체 보고

SimpleVQA

0.56 / 100자체 보고

Grounding

RefCOCO-avg

0.93 / 100자체 보고

RefSpatialBench

0.70 / 100자체 보고

Healthcare

VideoMMMU

84.4%자체 보고

Image To Text

OCRBench

89.4%자체 보고

Long Context

MLVU

86.6%자체 보고

Math

AIME 2026

94.1%자체 보고

HMMT 2025

93.8%자체 보고

HMMT25

90.7%자체 보고

MathVista-Mini

87.4%자체 보고

DynaMath

85.6%자체 보고

HMMT Feb 26

84.3%자체 보고

IMO-AnswerBench

80.8%자체 보고

Humanity's Last Exam

24.0%자체 보고

Multimodal

VLMsAreBlind

97.0%자체 보고

94.7%자체 보고

MMBench-V1.1

92.3%자체 보고

VideoMME w sub.

87.7%자체 보고

CC-OCR

81.2%자체 보고

CharXiv-R

78.4%자체 보고

MVBench

75.5%자체 보고

Reasoning

CountBench

0.98 / 100자체 보고

ERQA

62.5%자체 보고

Spatial Reasoning

RealWorldQA

84.1%자체 보고

AA 평가 지수

Coding Index

53.7

Intelligence Index

37.1

Tau2

0.9

Gpqa

0.8

Lcr

0.7

Ifbench

0.7

Terminalbench V2 1

0.6

Scicode

0.4

Terminalbench Hard

0.3

Hle

0.2

Tau Banking

0.2

LLM Stats 카테고리 점수

Legal

100

Finance

100

Agents

100

General

100

Reasoning

Language

Long Context

Biology

Math

Multimodal

Physics

Spatial Reasoning

Structured Output

Embodied

Frontend Development

Grounding

Healthcare

Chemistry

Text-to-image

Video

Vision

Image To Text

Code

Economics

Tool Calling

Coding

가격

입력 가격$0.6 / 1M 토큰

출력 가격$3.6 / 1M 토큰

혼합 가격 (3:1)$1.35 / 1M 토큰

속도

토큰/초66.0

첫 토큰 지연1.33s

첫 응답 지연87.31s

공급자 가격 순위

9개 공급자

최저가: Novita최고가: routing.run

공급자입력출력

1Novita최저가

2Chutes

$0.195

$1.56

3NanoGPT

$0.203

$2.24

4OpenRouter

$0.2885

$3.17

5Kilo Gateway

$0.325

$3.25

6Venice AI

$0.325

$3.25

7Alibaba주요

$0.6

$3.6

8Vercel AI Gateway

$0.6

$3.6

9routing.run

$1.1

$3.3

이 모델의 다양한 API 공급자 간 가격 비교.

외부 링크

LLM Stats Artificial Analysis