Qwen3 235B A22B 2507 (Reasoning)
AlibabaQwenOpen WeightApache 2.0 · Commercial OK
描述
Qwen3-235B-A22B-Thinking-2507 is a state-of-the-art thinking-enabled Mixture-of-Experts (MoE) model with 235B total parameters (22B activated). It features 94 layers, 128 experts (8 activated), and supports 262K native context length. This version delivers significantly improved reasoning performance, achieving state-of-the-art results among open-source thinking models on logical reasoning, mathematics, science, coding, and academic benchmarks. Key enhancements include markedly better general capabilities (instruction following, tool usage, text generation), enhanced 256K long-context understanding, and increased thinking depth. The model supports only thinking mode with automatic <think> tag inclusion.
發布日期
2025-07-25
參數規模
235.0B
上下文長度
262K
支援模態
text
能力雷達圖
44
general
45
coding
92
reasoning
53
science估算
60
agents
0
multimodal
Science 在缺少專門科學評測時使用推理能力代理估算。
排行榜排名
基準測試分數 (LLM Stats)
Agents
BFCL-v3
71.9%自報
Biology
GPQA
81.1%自報
Chemistry
SuperGPQA
64.9%自報
Code
CFEval
2134.00 / 10000自報
Communication
WritingBench
88.3%自報
Multi-IF
80.6%自報
Tau2 Retail
71.9%自報
TAU-bench Retail
67.8%自報
Tau2 Airline
58.0%自報
TAU-bench Airline
46.0%自報
Tau2 Telecom
45.6%自報
Creativity
Creative Writing v3
86.1%自報
Arena-Hard v2
79.7%自報
Finance
MMLU-Pro
84.4%自報
MMLU-ProX
81.0%自報
General
MMLU-Redux
93.8%自報
IFEval
87.8%自報
Include
81.0%自報
LiveBench 20241125
78.4%自報
LiveCodeBench v6
74.1%自報
Math
AIME 2025
92.3%自報
HMMT25
83.9%自報
PolyMATH
60.1%自報
Humanity's Last Exam
18.2%自報
Reasoning
OJBench
32.5%自報
AA 評測指數
Math Index91.0
Intelligence Index29.5
Coding Index23.2
Math 5001.0
Aime0.9
Aime 250.9
Mmlu Pro0.8
Gpqa0.8
Livecodebench0.8
Lcr0.7
Tau20.5
Ifbench0.5
Scicode0.4
Hle0.1
Terminalbench Hard0.1
LLM Stats 分類評分
Structured Output80
Writing80
Biology80
Creativity80
Finance80
General80
Healthcare80
Instruction Following80
Language80
Legal80
Agents70
Chemistry70
Communication70
Math70
Physics70
Reasoning70
Spatial Reasoning60
Tool Calling60
Economics60
Multimodal60
Vision40
定價
輸入價格$0.4 / 1M tokens
輸出價格$2.15 / 1M tokens
混合價格(3:1)$0.838 / 1M tokens
速度
Tokens/秒58.5 tokens/s
首Token延遲1.22s
首回答延遲35.39s
可用提供商
(LS 內部計價單位)| 提供商 | 輸入價格 | 輸出價格 |
|---|---|---|
| Fireworks | 300K | 3.0M |
| Novita | 300K | 3.0M |