Qwen3.5 27B (Reasoning)
AlibabaQwenOpen WeightApache 2.0 · Commercial OK
विवरण
Qwen3.5-27B is a multimodal dense foundation model with 27 billion parameters. It combines strong reasoning, coding, multilingual, long-context, and visual understanding performance in a production-friendly open-weight package with a native 262K context window.
रिलीज़ तिथि
2026-02-24
पैरामीटर
27.0B
संदर्भ लंबाई
262K
मोडैलिटीज़
image, text, video
क्षमता रडार
38
general
36
coding
86
reasoning
57
scienceअनुमानित
60
agents
80
multimodal
समर्पित विज्ञान बेंचमार्क उपलब्ध न होने पर Science तर्क प्रॉक्सी का उपयोग करके अनुमान लगाता है।
रैंकिंग
| डोमेन | #रैंक | स्कोर | स्रोत |
|---|---|---|---|
| Agents & Tools | 51 | 57.0 | LS |
| Code Ranking | 84 | 65.0 | AA |
| General Ranking | 43 | 80.0 | AA |
| Multimodal Ranking | 60 | 70.0 | LS |
| Reasoning | 54 | 67.0 | LS |
| Science | 63 | 68.0 | AA |
बेंचमार्क स्कोर (LLM Stats)
3d
SUNRGBD
0.35 / 100स्वयं
Hypersim
0.13 / 100स्वयं
Agents
t2-bench
79.0%स्वयं
BFCL-V4
68.5%स्वयं
AndroidWorld_SR
64.2%स्वयं
WideSearch
61.1%स्वयं
BrowseComp
61.0%स्वयं
FullStackBench en
60.1%स्वयं
TIR-Bench
59.8%स्वयं
FullStackBench zh
57.4%स्वयं
OSWorld-Verified
56.2%स्वयं
VITA-Bench
41.9%स्वयं
Terminal-Bench 2.0
41.6%स्वयं
DeepPlanning
22.6%स्वयं
Biology
GPQA
85.5%स्वयं
Chemistry
SuperGPQA
65.6%स्वयं
Code
SWE-Bench Verified
72.4%स्वयं
Communication
Multi-Challenge
60.8%स्वयं
Embodied
EmbSpatialBench
0.84 / 100स्वयं
Finance
MMLU-Pro
86.1%स्वयं
MMLU-ProX
82.2%स्वयं
General
IFEval
95.0%स्वयं
MMLU-Redux
93.2%स्वयं
C-Eval
90.5%स्वयं
MAXIFE
88.0%स्वयं
Global PIQA
87.5%स्वयं
MMMLU
85.9%स्वयं
MMMU
82.3%स्वयं
Include
81.6%स्वयं
MMStar
81.0%स्वयं
LiveCodeBench v6
80.7%स्वयं
IFBench
76.5%स्वयं
MMMU-Pro
75.0%स्वयं
LongBench v2
60.6%स्वयं
NOVA-63
58.1%स्वयं
SimpleVQA
0.56 / 100स्वयं
Grounding
RefCOCO-avg
0.91 / 100स्वयं
ScreenSpot Pro
70.3%स्वयं
RefSpatialBench
0.68 / 100स्वयं
Healthcare
VideoMMMU
82.3%स्वयं
SlakeVQA
80.0%स्वयं
MedXpertQA
62.4%स्वयं
PMC-VQA
62.4%स्वयं
Image To Text
OCRBench
89.4%स्वयं
Language
LingoQA
82.0%स्वयं
WMT24++
77.6%स्वयं
Long Context
MLVU
85.9%स्वयं
LVBench
73.6%स्वयं
AA-LCR
66.1%स्वयं
MMLongBench-Doc
0.60 / 100स्वयं
Math
HMMT 2025
92.0%स्वयं
HMMT25
89.8%स्वयं
MathVista-Mini
87.8%स्वयं
DynaMath
87.7%स्वयं
MathVision
86.0%स्वयं
CodeForces
0.81 / 3000स्वयं
PolyMATH
71.2%स्वयं
Humanity's Last Exam
48.5%स्वयं
Multimodal
VLMsAreBlind
96.9%स्वयं
V*
93.7%स्वयं
AI2D
92.9%स्वयं
MMBench-V1.1
92.6%स्वयं
OmniDocBench 1.5
88.9%स्वयं
VideoMME w sub.
87.0%स्वयं
VideoMME w/o sub.
82.8%स्वयं
CC-OCR
81.0%स्वयं
CharXiv-R
79.5%स्वयं
MVBench
74.6%स्वयं
MMVU
73.3%स्वयं
BabyVision
44.6%स्वयं
ZEROBench-Sub
0.36 / 100स्वयं
Nuscene
15.2%स्वयं
ZEROBench
0.10 / 100स्वयं
Reasoning
CountBench
0.98 / 100स्वयं
Hallusion Bench
70.0%स्वयं
BrowseComp-zh
62.1%स्वयं
ERQA
60.5%स्वयं
Seal-0
47.2%स्वयं
OJBench
40.1%स्वयं
Spatial Reasoning
RealWorldQA
83.7%स्वयं
Vision
ODinW
41.1%स्वयं
AA मूल्यांकन सूचकांक
Intelligence Index42.1
Coding Index34.9
Tau20.9
Gpqa0.9
Ifbench0.8
Lcr0.7
Scicode0.4
Terminalbench Hard0.3
Hle0.2
LLM Stats श्रेणी स्कोर
Biology90
Instruction Following90
Structured Output80
Text-to-image80
Video80
Chemistry80
Embodied80
Finance80
General80
Grounding80
Image To Text80
Language80
Legal80
Math80
Physics80
Spatial Reasoning70
Vision70
Economics70
Frontend Development70
Healthcare70
Long Context70
Multimodal70
Reasoning70
Tool Calling60
Agents60
Code60
Communication60
Search60
Spatial20
3d20
मूल्य निर्धारण
इनपुट मूल्य$0.3 / 1M tokens
आउटपुट मूल्य$2.4 / 1M tokens
मिश्रित मूल्य (3:1)$0.825 / 1M tokens
गति
टोकन/सेकंड87.6 tokens/s
पहले टोकन में देरी1.40s
पहले उत्तर में देरी24.23s
उपलब्ध प्रदाता
(LS आंतरिक इकाइयाँ)| प्रदाता | इनपुट मूल्य | आउटपुट मूल्य |
|---|---|---|
| Novita | 300K | 2.4M |