Qwen3.5 0.8B (Non-reasoning)

AlibabaQwenOpen WeightApache 2.0 · Commercial OK

Description

Qwen3.5-0.8B is a 0.8 billion parameter vision-language model using Gated DeltaNet hybrid architecture with a 3:1 ratio of linear attention to full softmax attention. It supports 262K native context length and features both thinking and non-thinking modes.

Release Date

2026-03-02

Parameters

800M

Context Length

41K

Modalities

text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Agents & Tools	100	17.0	LS
Code Ranking	463	3.0	AA
General Ranking	407	24.0	AA
Science	474	10.0	AA

Benchmark Scores (LLM Stats)

Agents

BFCL-V4

25.3%SR

t2-bench

11.6%SR

Biology

GPQA

11.9%SR

Chemistry

SuperGPQA

21.3%SR

Communication

Multi-Challenge

18.9%SR

Finance

MMLU-Pro

42.3%SR

MMLU-ProX

34.6%SR

General

MMLU-Redux

59.5%SR

Global PIQA

59.4%SR

C-Eval

50.5%SR

MMMLU

44.3%SR

IFEval

44.0%SR

NOVA-63

42.4%SR

Include

40.6%SR

MAXIFE

39.2%SR

LongBench v2

26.1%SR

IFBench

21.0%SR

Language

WMT24++

27.2%SR

Long Context

AA-LCR

4.7%SR

Math

PolyMATH

8.2%SR

AA Evaluation Indices

Intelligence Index

9.9

Coding Index

1.0

Tau2

0.7

Gpqa

0.2

Ifbench

0.2

Lcr

0.1

Hle

0.0

Scicode

0.0

Terminalbench Hard

0.0

LLM Stats Category Scores

Structured Output

General

Language

Math

Finance

Healthcare

Instruction Following

Legal

Physics

Reasoning

Tool Calling

Agents

Chemistry

Communication

Economics

Long Context

Spatial Reasoning

Vision

Biology

Multimodal

Pricing

Input Price$0.01 / 1M tokens

Output Price$0.05 / 1M tokens

Blended Price (3:1)$0.02 / 1M tokens

Speed

Tokens/sec367.0 tokens/s

Time to First Token0.48s

Time to Answer0.48s

Available Providers

(LS internal units)

No provider data available

External Sources

LLM Stats