Qwen3.6 Plus

AlibabaQwenProprietary

説明

Qwen3.6 Plus is Alibaba's next-generation flagship model featuring a 1 million token native context window, up to 65,536 output tokens, and always-on chain-of-thought reasoning. It uses a next-generation hybrid architecture optimized for efficiency and scalability. It leads on Terminal-Bench 2.0 agentic coding (61.6), surpassing Claude 4.5 Opus, and achieves strong results on document understanding (OmniDocBench 91.2) and multimodal reasoning (MMMU 86.0). Compared to Qwen 3.5, it is significantly more decisive in reasoning, using fewer tokens on straightforward tasks with better agent stability.

リリース日

2026-04-02

パラメータ

—

コンテキスト長

1.0M

モダリティ

image, text, video

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Agents

GDPval-AA

1160.00 / 3000自己申告

WideSearch

74.3%自己申告

MCP Atlas

74.1%自己申告

TAU3-Bench

70.7%自己申告

OSWorld-Verified

62.5%自己申告

TIR-Bench

61.6%自己申告

Terminal-Bench 2.0

61.6%自己申告

Claw-Eval

58.7%自己申告

SWE-Bench Pro

56.6%自己申告

MCP-Mark

48.2%自己申告

SkillsBench

45.7%自己申告

VITA-Bench

44.3%自己申告

DeepPlanning

41.5%自己申告

Finance Agent v2

40.8%自己申告

Toolathlon

39.8%自己申告

NL2Repo

37.9%自己申告

FrontierSWE

22.0%自己申告

Biology

GPQA

90.4%自己申告

Chemistry

SuperGPQA

71.6%自己申告

Code

SWE-Bench Verified

78.8%自己申告

SWE-bench Multilingual

73.8%自己申告

Finance

MMLU-Pro

88.5%自己申告

MMLU-ProX

84.7%自己申告

General

MMLU-Redux

94.5%自己申告

IFEval

94.3%自己申告

C-Eval

93.3%自己申告

Global PIQA

89.8%自己申告

MMMLU

89.5%自己申告

MAXIFE

88.2%自己申告

LiveCodeBench v6

87.1%自己申告

MMMU

86.0%自己申告

Include

85.1%自己申告

MMStar

83.3%自己申告

MMMU-Pro

78.8%自己申告

IFBench

74.2%自己申告

LiveBench

70.9%自己申告

SimpleVQA

0.67 / 100自己申告

LongBench v2

62.0%自己申告

NOVA-63

57.9%自己申告

Grounding

RefCOCO-avg

0.94 / 100自己申告

ScreenSpot Pro

68.2%自己申告

Healthcare

VideoMMMU

84.0%自己申告

Language

WMT24++

84.3%自己申告

Long Context

MLVU

86.7%自己申告

AA-LCR

68.3%自己申告

MMLongBench-Doc

0.62 / 100自己申告

Math

HMMT 2025

96.7%自己申告

AIME 2026

95.3%自己申告

HMMT25

94.6%自己申告

We-Math

89.0%自己申告

DynaMath

88.0%自己申告

MathVision

88.0%自己申告

HMMT Feb 26

87.8%自己申告

IMO-AnswerBench

83.8%自己申告

PolyMATH

77.4%自己申告

Humanity's Last Exam

28.8%自己申告

Multimodal

96.9%自己申告

AI2D

94.4%自己申告

OmniDocBench 1.5

91.2%自己申告

Video-MME

84.2%自己申告

CC-OCR

83.4%自己申告

CharXiv-R

81.5%自己申告

Reasoning

CountBench

0.98 / 100自己申告

ERQA

65.7%自己申告

Spatial Reasoning

RealWorldQA

85.4%自己申告

Vision

ODinW

51.8%自己申告

AA評価指数

Coding Index

54.5

Intelligence Index

39.6

Tau2

1.0

Gpqa

0.9

Ifbench

0.8

Lcr

0.7

Terminalbench V2 1

0.6

Terminalbench Hard

0.4

Scicode

0.4

Hle

0.3

Tau Banking

0.2

LLM Statsカテゴリスコア

Legal

100

Finance

100

Agents

General

Reasoning

Language

Biology

Video

Math

Multimodal

Physics

Spatial Reasoning

Structured Output

Instruction Following

Frontend Development

Grounding

Healthcare

Chemistry

Text-to-image

Vision

Image To Text

Long Context

Economics

Code

Tool Calling

Coding

価格設定

入力価格$0.5 / 1Mトークン

出力価格$3 / 1Mトークン

混合価格（3:1）$1.125 / 1Mトークン

キャッシュ読み取り価格$0.05 / 1Mトークン

キャッシュ書き込み価格$0.625 / 1Mトークン

速度

トークン/秒52.6

初トークン遅延1.50s

初回答遅延107.00s

プロバイダー価格ランキング

16 プロバイダー

最安: Together最高: Venice AI

プロバイダー入力出力

1Together最安

2AIHubMix

$0.28

$1.69

3OpenRouter

$0.325

$1.95

4Kilo Gateway

$0.325

$1.95

5NanoGPT

$0.45

$2.7

6Alibabaプライマリ

$0.5

7OpenCode Go

$0.5

8Alibaba (China)

$0.5

9ZenMux

$0.5

10FrogBot

$0.5

11Vercel AI Gateway

$0.5

12LLM Gateway

$0.5

13Together AI

$0.5

14Auriko

$0.5

15OrcaRouter

$0.5

16Venice AI

$0.625

$3.75

このモデルの異なるAPIプロバイダー間の価格を比較。

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
エージェント能力	68	54.0	LS
コーディングランキング	52	76.0	AA
総合ランキング	23	80.0	AA
マルチモーダルランキング	17	87.0	LS
推論	29	82.0	LS
科学	62	69.0	AA