Qwen3.5 4B (Non-reasoning)

AlibabaQwen开源权重Apache 2.0 · 商用许可

描述

Qwen3.5-4B is a 4 billion parameter vision-language model using Gated DeltaNet hybrid architecture with a 3:1 ratio of linear attention to full softmax attention. It supports 262K native context length and delivers strong performance for its size across knowledge, reasoning, coding, and multilingual tasks.

发布日期

2026-03-02

参数规模

4.0B

上下文长度

—

支持模态

—

能力雷达图

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域	#排名	分数	来源
智能体能力模型榜	95	47.0	LS
代码能力榜	310	27.0	AA
通用能力榜	248	42.0	AA
科学能力	304	38.0	AA

基准测试分数 (LLM Stats)

Agents

t2-bench

79.9%自报

BFCL-V4

50.3%自报

VITA-Bench

22.0%自报

DeepPlanning

17.6%自报

Biology

GPQA

76.2%自报

Chemistry

SuperGPQA

52.9%自报

Communication

Multi-Challenge

49.0%自报

Finance

MMLU-Pro

79.1%自报

MMLU-ProX

71.5%自报

General

IFEval

89.8%自报

MMLU-Redux

88.8%自报

C-Eval

85.1%自报

Global PIQA

78.9%自报

MAXIFE

78.0%自报

MMMLU

76.1%自报

Include

71.0%自报

IFBench

59.2%自报

LiveCodeBench v6

55.8%自报

NOVA-63

54.3%自报

LongBench v2

50.0%自报

Language

WMT24++

66.6%自报

Long Context

AA-LCR

57.0%自报

Math

HMMT25

76.8%自报

HMMT 2025

74.0%自报

PolyMATH

51.1%自报

AA 评测指数

Coding Index

20.3

Intelligence Index

16.0

Tau2

0.9

Gpqa

0.7

Ifbench

0.3

Lcr

0.3

Terminalbench V2 1

0.2

Scicode

0.2

Terminalbench Hard

0.1

Hle

0.1

Tau Banking

0.0

LLM Stats 分类评分

Language

Biology

Instruction Following

Legal

Math

Physics

Structured Output

Finance

General

Healthcare

Tool Calling

Reasoning

Chemistry

Long Context

Multimodal

Spatial Reasoning

Communication

Economics

Vision

Agents

定价

输入价格$0.03 / 1M tokens

输出价格$0.15 / 1M tokens

混合价格(3:1)$0.06 / 1M tokens

速度

Tokens/秒40.6

首Token延迟0.43s

首回答延迟0.43s

供应商价格排行

1 个供应商

供应商输入输出

1Alibaba主要

$0.03

$0.15

比较该模型在不同 API 供应商之间的定价。

外部链接

LLM Stats Artificial Analysis