Qwen3.5 27B (Reasoning)
AlibabaQwenOpen WeightApache 2.0 · Commercial OK
Описание
Qwen3.5-27B is a multimodal dense foundation model with 27 billion parameters. It combines strong reasoning, coding, multilingual, long-context, and visual understanding performance in a production-friendly open-weight package with a native 262K context window.
Дата выхода
2026-02-24
Параметры
27.0B
Длина контекста
262K
Модальности
image, text, video
Радар способностей
38
general
36
coding
86
reasoning
57
scienceоцен.
60
agents
80
multimodal
Science использует прокси на основе рассуждений, когда специализированные научные бенчмарки недоступны.
Рейтинги
| Домен | #Место | Оценка | Источник |
|---|---|---|---|
| Agents & Tools | 51 | 57.0 | LS |
| Code Ranking | 84 | 65.0 | AA |
| General Ranking | 43 | 80.0 | AA |
| Multimodal Ranking | 60 | 70.0 | LS |
| Reasoning | 54 | 67.0 | LS |
| Science | 63 | 68.0 | AA |
Оценки бенчмарков (LLM Stats)
3d
SUNRGBD
0.35 / 100Сам.
Hypersim
0.13 / 100Сам.
Agents
t2-bench
79.0%Сам.
BFCL-V4
68.5%Сам.
AndroidWorld_SR
64.2%Сам.
WideSearch
61.1%Сам.
BrowseComp
61.0%Сам.
FullStackBench en
60.1%Сам.
TIR-Bench
59.8%Сам.
FullStackBench zh
57.4%Сам.
OSWorld-Verified
56.2%Сам.
VITA-Bench
41.9%Сам.
Terminal-Bench 2.0
41.6%Сам.
DeepPlanning
22.6%Сам.
Biology
GPQA
85.5%Сам.
Chemistry
SuperGPQA
65.6%Сам.
Code
SWE-Bench Verified
72.4%Сам.
Communication
Multi-Challenge
60.8%Сам.
Embodied
EmbSpatialBench
0.84 / 100Сам.
Finance
MMLU-Pro
86.1%Сам.
MMLU-ProX
82.2%Сам.
General
IFEval
95.0%Сам.
MMLU-Redux
93.2%Сам.
C-Eval
90.5%Сам.
MAXIFE
88.0%Сам.
Global PIQA
87.5%Сам.
MMMLU
85.9%Сам.
MMMU
82.3%Сам.
Include
81.6%Сам.
MMStar
81.0%Сам.
LiveCodeBench v6
80.7%Сам.
IFBench
76.5%Сам.
MMMU-Pro
75.0%Сам.
LongBench v2
60.6%Сам.
NOVA-63
58.1%Сам.
SimpleVQA
0.56 / 100Сам.
Grounding
RefCOCO-avg
0.91 / 100Сам.
ScreenSpot Pro
70.3%Сам.
RefSpatialBench
0.68 / 100Сам.
Healthcare
VideoMMMU
82.3%Сам.
SlakeVQA
80.0%Сам.
MedXpertQA
62.4%Сам.
PMC-VQA
62.4%Сам.
Image To Text
OCRBench
89.4%Сам.
Language
LingoQA
82.0%Сам.
WMT24++
77.6%Сам.
Long Context
MLVU
85.9%Сам.
LVBench
73.6%Сам.
AA-LCR
66.1%Сам.
MMLongBench-Doc
0.60 / 100Сам.
Math
HMMT 2025
92.0%Сам.
HMMT25
89.8%Сам.
MathVista-Mini
87.8%Сам.
DynaMath
87.7%Сам.
MathVision
86.0%Сам.
CodeForces
0.81 / 3000Сам.
PolyMATH
71.2%Сам.
Humanity's Last Exam
48.5%Сам.
Multimodal
VLMsAreBlind
96.9%Сам.
V*
93.7%Сам.
AI2D
92.9%Сам.
MMBench-V1.1
92.6%Сам.
OmniDocBench 1.5
88.9%Сам.
VideoMME w sub.
87.0%Сам.
VideoMME w/o sub.
82.8%Сам.
CC-OCR
81.0%Сам.
CharXiv-R
79.5%Сам.
MVBench
74.6%Сам.
MMVU
73.3%Сам.
BabyVision
44.6%Сам.
ZEROBench-Sub
0.36 / 100Сам.
Nuscene
15.2%Сам.
ZEROBench
0.10 / 100Сам.
Reasoning
CountBench
0.98 / 100Сам.
Hallusion Bench
70.0%Сам.
BrowseComp-zh
62.1%Сам.
ERQA
60.5%Сам.
Seal-0
47.2%Сам.
OJBench
40.1%Сам.
Spatial Reasoning
RealWorldQA
83.7%Сам.
Vision
ODinW
41.1%Сам.
Индексы оценки AA
Intelligence Index42.1
Coding Index34.9
Tau20.9
Gpqa0.9
Ifbench0.8
Lcr0.7
Scicode0.4
Terminalbench Hard0.3
Hle0.2
Оценки категорий LLM Stats
Biology90
Instruction Following90
Structured Output80
Text-to-image80
Video80
Chemistry80
Embodied80
Finance80
General80
Grounding80
Image To Text80
Language80
Legal80
Math80
Physics80
Spatial Reasoning70
Vision70
Economics70
Frontend Development70
Healthcare70
Long Context70
Multimodal70
Reasoning70
Tool Calling60
Agents60
Code60
Communication60
Search60
Spatial20
3d20
Цены
Цена ввода$0.3 / 1M tokens
Цена вывода$2.4 / 1M tokens
Смешанная цена (3:1)$0.825 / 1M tokens
Скорость
Токенов/сек87.6 tokens/s
Задержка первого токена1.40s
Время до первого ответа24.23s
Доступные провайдеры
(Внутренние единицы LS)| Провайдер | Цена ввода | Цена вывода |
|---|---|---|
| Novita | 300K | 2.4M |