Qwen3.5 27B (Reasoning)
AlibabaQwenОткрытые весаApache 2.0 · Коммерческое использование
Описание
Qwen3.5-27B is a multimodal dense foundation model with 27 billion parameters. It combines strong reasoning, coding, multilingual, long-context, and visual understanding performance in a production-friendly open-weight package with a native 262K context window.
Дата выхода
2026-02-24
Параметры
27.0B
Длина контекста
262K
Модальности
audio, image, text, video
Радар способностей
31
general
40
coding
86
reasoning
57
scienceоцен.
60
agents
80
multimodal
Science использует прокси на основе рассуждений, когда специализированные научные бенчмарки недоступны.
Рейтинги
| Домен | #Место | Оценка | Источник |
|---|---|---|---|
| Агентные возможности | 49 | 57.0 | LS |
| Рейтинг кодинга | 82 | 71.0 | AA |
| Общий рейтинг | 59 | 74.0 | AA |
| Мультимодальный рейтинг | 69 | 70.0 | LS |
| Рассуждения | 58 | 67.0 | LS |
| Наука | 76 | 65.0 | AA |
Оценки бенчмарков (LLM Stats)
3d
SUNRGBD
0.35 / 100Сам.
Hypersim
0.13 / 100Сам.
Agents
t2-bench
79.0%Сам.
BFCL-V4
68.5%Сам.
AndroidWorld_SR
64.2%Сам.
WideSearch
61.1%Сам.
BrowseComp
61.0%Сам.
FullStackBench en
60.1%Сам.
TIR-Bench
59.8%Сам.
FullStackBench zh
57.4%Сам.
OSWorld-Verified
56.2%Сам.
VITA-Bench
41.9%Сам.
Terminal-Bench 2.0
41.6%Сам.
DeepPlanning
22.6%Сам.
Biology
GPQA
85.5%Сам.
Chemistry
SuperGPQA
65.6%Сам.
Code
SWE-Bench Verified
72.4%Сам.
Communication
Multi-Challenge
60.8%Сам.
Embodied
EmbSpatialBench
0.84 / 100Сам.
Finance
MMLU-Pro
86.1%Сам.
MMLU-ProX
82.2%Сам.
General
IFEval
95.0%Сам.
MMLU-Redux
93.2%Сам.
C-Eval
90.5%Сам.
MAXIFE
88.0%Сам.
Global PIQA
87.5%Сам.
MMMLU
85.9%Сам.
MMMU
82.3%Сам.
Include
81.6%Сам.
MMStar
81.0%Сам.
LiveCodeBench v6
80.7%Сам.
IFBench
76.5%Сам.
MMMU-Pro
75.0%Сам.
LongBench v2
60.6%Сам.
NOVA-63
58.1%Сам.
SimpleVQA
0.56 / 100Сам.
Grounding
RefCOCO-avg
0.91 / 100Сам.
ScreenSpot Pro
70.3%Сам.
RefSpatialBench
0.68 / 100Сам.
Healthcare
VideoMMMU
82.3%Сам.
SlakeVQA
80.0%Сам.
MedXpertQA
62.4%Сам.
PMC-VQA
62.4%Сам.
Image To Text
OCRBench
89.4%Сам.
Language
LingoQA
82.0%Сам.
WMT24++
77.6%Сам.
Long Context
MLVU
85.9%Сам.
LVBench
73.6%Сам.
AA-LCR
66.1%Сам.
MMLongBench-Doc
0.60 / 100Сам.
Math
HMMT 2025
92.0%Сам.
HMMT25
89.8%Сам.
MathVista-Mini
87.8%Сам.
DynaMath
87.7%Сам.
MathVision
86.0%Сам.
CodeForces
0.81 / 3000Сам.
PolyMATH
71.2%Сам.
Humanity's Last Exam
48.5%Сам.
Multimodal
VLMsAreBlind
96.9%Сам.
V*
93.7%Сам.
AI2D
92.9%Сам.
MMBench-V1.1
92.6%Сам.
OmniDocBench 1.5
88.9%Сам.
VideoMME w sub.
87.0%Сам.
VideoMME w/o sub.
82.8%Сам.
CC-OCR
81.0%Сам.
CharXiv-R
79.5%Сам.
MVBench
74.6%Сам.
MMVU
73.3%Сам.
BabyVision
44.6%Сам.
ZEROBench-Sub
0.36 / 100Сам.
Nuscene
15.2%Сам.
ZEROBench
0.10 / 100Сам.
Reasoning
CountBench
0.98 / 100Сам.
Hallusion Bench
70.0%Сам.
BrowseComp-zh
62.1%Сам.
ERQA
60.5%Сам.
Seal-0
47.2%Сам.
OJBench
40.1%Сам.
Spatial Reasoning
RealWorldQA
83.7%Сам.
Vision
ODinW
41.1%Сам.
Индексы оценки AA
Intelligence Index33.8
Tau20.9
Gpqa0.9
Ifbench0.8
Lcr0.7
Scicode0.4
Terminalbench Hard0.3
Hle0.2
Оценки категорий LLM Stats
Instruction Following90
Biology90
Image To Text80
Language80
Legal80
Math80
Physics80
Structured Output80
Embodied80
Finance80
General80
Grounding80
Chemistry80
Text-to-image80
Video80
Long Context70
Multimodal70
Reasoning70
Spatial Reasoning70
Frontend Development70
Healthcare70
Economics70
Vision70
Search60
Agents60
Code60
Communication60
Tool Calling60
Spatial20
3d20
Цены
Цена ввода$0.3 / 1M токенов
Цена вывода$2.4 / 1M токенов
Смешанная цена (3:1)$0.825 / 1M токенов
Скорость
Токенов/сек86.8
Задержка первого токена1.47s
Время до первого ответа24.52s
Рейтинг цен провайдеров
Рейтинг цен провайдеров
10 провайдеров
Самый дешевый: NovitaСамый дорогой: NanoGPT
ПровайдерВводВывод
1NovitaСамый дешевый
$0
$0
2OrcaRouter
$0.086
$0.688
3OpenRouter
$0.195
$1.56
4Kilo Gateway
$0.195
$1.56
5SiliconFlow (China)
$0.26
$2.09
6AlibabaОсновной
$0.3
$2.4
7Hugging Face
$0.3
$2.4
8NovitaAI
$0.3
$2.4
9Mixlayer
$0.3
$2.4
10NanoGPT
$0.306
$0.306
Сравнение цен разных API-провайдеров для этой модели.