Qwen3 235B A22B (Non-reasoning)
AlibabaQwenOpen WeightApache 2.0 · Commercial OK
Description
Qwen3 235B A22B is a large language model developed by Alibaba, featuring a Mixture-of-Experts (MoE) architecture with 235 billion total parameters and 22 billion activated parameters. It achieves competitive results in benchmark evaluations of coding, math, general capabilities, and more, compared to other top-tier models.
Release Date
2025-04-28
Parameters
235.0B
Context Length
131K
Modalities
text
Capability Radar
33
general
23
coding
40
reasoning
39
scienceest.
70
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 351 | 19.0 | AA |
| General Ranking | 286 | 38.0 | AA |
| Math Reasoning | 227 | 39.0 | AA |
| Reasoning | 32 | 79.0 | LS |
| Science | 275 | 40.0 | AA |
Benchmark Scores (LLM Stats)
Biology
GPQA
47.5%SR
Chemistry
SuperGPQA
44.1%SR
Code
EvalPlus
0.78 / 100SR
LiveCodeBench
70.7%SR
Aider
61.8%SR
Creativity
Arena Hard
95.6%SR
Finance
MMLU
87.8%SR
MMLU-Pro
68.2%SR
General
MMLU-Redux
87.4%SR
MMMLU
86.7%SR
MBPP
0.81 / 100SR
LiveBench
77.1%SR
Include
73.5%SR
MultiLF
71.9%SR
BFCL
70.8%SR
MultiPL-E
65.9%SR
Language
BBH
88.9%SR
Math
GSM8k
94.4%SR
AIME 2024
85.7%SR
MGSM
83.5%SR
AIME 2025
81.5%SR
MATH
71.8%SR
Reasoning
CRUX-O
0.79 / 100SR
AA Evaluation Indices
Math Index23.7
Intelligence Index17.0
Coding Index14.0
Math 5000.9
Mmlu Pro0.8
Gpqa0.6
Ifbench0.4
Livecodebench0.3
Aime0.3
Scicode0.3
Tau20.3
Aime 250.2
Terminalbench Hard0.1
Hle0.0
Lcr0.0
LLM Stats Category Scores
Writing100
Creativity100
Language80
Math80
Reasoning80
Tool Calling70
Code70
Finance70
General70
Healthcare70
Legal70
Biology50
Chemistry50
Physics50
Economics40
Pricing
Input Price$0.45 / 1M tokens
Output Price$1.8 / 1M tokens
Blended Price (3:1)$0.787 / 1M tokens
Speed
Tokens/sec64.1 tokens/s
Time to First Token1.24s
Time to Answer1.24s
Available Providers
(LS internal units)No provider data available