Qwen3.5 0.8B (Non-reasoning)
AlibabaQwenOpen WeightApache 2.0 · Commercial OK
Description
Qwen3.5-0.8B is a 0.8 billion parameter vision-language model using Gated DeltaNet hybrid architecture with a 3:1 ratio of linear attention to full softmax attention. It supports 262K native context length and features both thinking and non-thinking modes.
Release Date
2026-03-02
Parameters
800M
Context Length
41K
Modalities
text
Capability Radar
9
general
1
coding
24
reasoning
13
scienceest.
20
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Agents & Tools | 100 | 17.0 | LS |
| Code Ranking | 463 | 3.0 | AA |
| General Ranking | 407 | 24.0 | AA |
| Science | 474 | 10.0 | AA |
Benchmark Scores (LLM Stats)
Agents
BFCL-V4
25.3%SR
t2-bench
11.6%SR
Biology
GPQA
11.9%SR
Chemistry
SuperGPQA
21.3%SR
Communication
Multi-Challenge
18.9%SR
Finance
MMLU-Pro
42.3%SR
MMLU-ProX
34.6%SR
General
MMLU-Redux
59.5%SR
Global PIQA
59.4%SR
C-Eval
50.5%SR
MMMLU
44.3%SR
IFEval
44.0%SR
NOVA-63
42.4%SR
Include
40.6%SR
MAXIFE
39.2%SR
LongBench v2
26.1%SR
IFBench
21.0%SR
Language
WMT24++
27.2%SR
Long Context
AA-LCR
4.7%SR
Math
PolyMATH
8.2%SR
AA Evaluation Indices
Intelligence Index9.9
Coding Index1.0
Tau20.7
Gpqa0.2
Ifbench0.2
Lcr0.1
Hle0.0
Scicode0.0
Terminalbench Hard0.0
LLM Stats Category Scores
Structured Output40
General40
Language40
Math40
Finance30
Healthcare30
Instruction Following30
Legal30
Physics30
Reasoning30
Tool Calling20
Agents20
Chemistry20
Communication20
Economics20
Long Context20
Spatial Reasoning10
Vision10
Biology10
Multimodal10
Pricing
Input Price$0.01 / 1M tokens
Output Price$0.05 / 1M tokens
Blended Price (3:1)$0.02 / 1M tokens
Speed
Tokens/sec367.0 tokens/s
Time to First Token0.48s
Time to Answer0.48s
Available Providers
(LS internal units)No provider data available