Qwen3 Next 80B A3B (Reasoning)

AlibabaQwenオープンウエイトApache 2.0 · 商用利用可

説明

Qwen3-Next-80B-A3B-Thinking is the thinking variant of the Qwen3-Next series, featuring the same groundbreaking architecture as the instruct model. Leveraging GSPO, it addresses stability and efficiency challenges of hybrid attention + high-sparsity MoE in RL training. It uses Hybrid Attention combining Gated DeltaNet and Gated Attention for efficient ultra-long context modeling, High-Sparsity MoE with 512 experts (10 activated + 1 shared), and Multi-Token Prediction. With 80B total parameters and only 3B activated, it demonstrates outstanding performance on complex reasoning tasks — outperforming Qwen3-30B-A3B-Thinking-2507, Qwen3-32B-Thinking, and even the proprietary Gemini-2.5-Flash-Thinking across multiple benchmarks. Architecture: 48 layers, 15T training tokens, hybrid layout of 12*(3*(Gated DeltaNet->MoE)->(Gated Attention->MoE)). Supports only thinking mode with automatic <think> tag inclusion, may generate longer thinking content.

リリース日

2025-09-11

パラメータ

80.0B

コンテキスト長

131K

モダリティ

text

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Agents

BFCL-v3

72.0%自己申告

Biology

GPQA

77.2%自己申告

Chemistry

SuperGPQA

60.8%自己申告

Code

CFEval

2071.00 / 10000自己申告

Communication

WritingBench

84.6%自己申告

Multi-IF

77.8%自己申告

TAU-bench Retail

69.6%自己申告

Tau2 Retail

67.8%自己申告

Tau2 Airline

60.5%自己申告

TAU-bench Airline

49.0%自己申告

Tau2 Telecom

43.9%自己申告

Creativity

Arena-Hard v2

62.3%自己申告

Finance

MMLU-Pro

82.7%自己申告

MMLU-ProX

78.7%自己申告

General

MMLU-Redux

92.5%自己申告

IFEval

88.9%自己申告

Include

78.9%自己申告

LiveBench 20241125

76.6%自己申告

LiveCodeBench v6

68.7%自己申告

Math

AIME 2025

87.8%自己申告

HMMT25

73.9%自己申告

PolyMATH

56.3%自己申告

Reasoning

OJBench

29.7%自己申告

AA評価指数

Math Index

84.3

Intelligence Index

19.8

Aime 25

0.8

Mmlu Pro

0.8

Livecodebench

0.8

Gpqa

0.8

Ifbench

0.6

Lcr

0.6

Tau2

0.4

Scicode

0.4

Hle

0.1

Terminalbench Hard

0.1

LLM Statsカテゴリスコア

Instruction Following

Language

Legal

Math

Structured Output

Finance

General

Biology

Physics

Reasoning

Healthcare

Agents

Chemistry

Creativity

Writing

Multimodal

Spatial Reasoning

Communication

Economics

Tool Calling

Vision

価格設定

入力価格$0.5 / 1Mトークン

出力価格$6 / 1Mトークン

混合価格（3:1）$1.875 / 1Mトークン

速度

トークン/秒189.3

初トークン遅延1.10s

初回答遅延11.67s

プロバイダー価格ランキング

12 プロバイダー

最安: IO.NET最高: LLM Gateway

プロバイダー入力出力

1IO.NET最安

$0.1

$0.8

2Chutes

$0.1

$0.8

3SiliconFlow (China)

$0.14

$1.4

4Amazon Bedrock

$0.14

$1.4

5SiliconFlow

$0.14

$1.4

6Alibaba (China)

$0.144

$1.434

7Nebius Token Factory

$0.15

$1.2

8Cortecs

$0.164

$1.311

9Hugging Face

$0.25

10Venice AI

$0.35

$1.9

11Alibabaプライマリ

$0.5

12LLM Gateway

$0.5

このモデルの異なるAPIプロバイダー間の価格を比較。

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
エージェント能力	10	72.0	LS
コーディングランキング	126	60.0	AA
総合ランキング	177	53.0	AA
数学的推論	65	85.0	AA
推論	105	30.0	LS
科学	152	54.0	AA