Qwen3 VL 32B (Reasoning)

AlibabaQwenОткрытые весаApache 2.0 · Коммерческое использование

Описание

Qwen3-VL is a large multimodal model that unifies vision, language, and reasoning to achieve human-level perception and cognition across text, images, and video. Built on a 235B-parameter architecture, it integrates early joint training of visual and textual modalities for strong language grounding. The model supports up to a 1 million-token context window and excels at visual understanding, spatial reasoning, long video comprehension, and tool-based interaction. It can generate code from images, perform precise 2D/3D object grounding, and operate digital interfaces like a visual agent. The “Instruct” version rivals Gemini 2.5 Pro in perception benchmarks, while the “Thinking” version leads in multimodal reasoning and STEM tasks. With multilingual OCR, creative writing, and fine-grained scene interpretation, Qwen3-VL establishes a new open-source frontier for integrated vision-language intelligence.

Дата выхода

2025-10-21

Параметры

33.0B

Длина контекста

131K

Модальности

image, text

Радар способностей

general

coding

reasoning

scienceоцен.

agents

multimodal

Science использует прокси на основе рассуждений, когда специализированные научные бенчмарки недоступны.

Рейтинги

Домен	#Место	Оценка	Источник
Агентные возможности	29	61.0	LS
Рейтинг кодинга	159	55.0	AA
Общий рейтинг	187	52.0	AA
Математическое мышление	61	86.0	AA
Мультимодальный рейтинг	33	80.0	LS
Рассуждения	71	58.0	LS
Наука	233	46.0	AA

Оценки бенчмарков (LLM Stats)

3d

BLINK

68.5%Сам.

Agents

BFCL-v3

71.7%Сам.

AndroidWorld_SR

63.7%Сам.

OSWorld

41.0%Сам.

Biology

GPQA

73.1%Сам.

Chemistry

SuperGPQA

59.0%Сам.

Communication

MM-MT-Bench

8.30 / 100Сам.

WritingBench

86.2%Сам.

Multi-IF

78.0%Сам.

Creativity

Creative Writing v3

83.3%Сам.

Arena-Hard v2

60.5%Сам.

Factuality

SimpleQA

55.4%Сам.

Finance

MMLU

88.7%Сам.

MMLU-Pro

82.1%Сам.

MMLU-ProX

77.2%Сам.

General

MMLU-Redux

91.9%Сам.

IFEval

87.8%Сам.

MMStar

79.4%Сам.

MMMU (val)

78.1%Сам.

Include

76.3%Сам.

LiveBench 20241125

74.7%Сам.

MMMU-Pro

68.1%Сам.

LiveCodeBench v6

65.6%Сам.

Grounding

ScreenSpot

95.7%Сам.

ScreenSpot Pro

57.1%Сам.

Healthcare

VideoMMMU

79.0%Сам.

Image To Text

OCRBench

85.5%Сам.

OCRBench-V2 (en)

68.4%Сам.

OCRBench-V2 (zh)

62.1%Сам.

Language

CharadesSTA

62.8%Сам.

Long Context

LVBench

62.6%Сам.

Math

MathVista-Mini

85.9%Сам.

AIME 2025

83.7%Сам.

MathVision

70.2%Сам.

PolyMATH

52.0%Сам.

Multimodal

DocVQAtest

96.1%Сам.

MMBench-V1.1

90.8%Сам.

CharXiv-D

90.2%Сам.

InfoVQAtest

89.2%Сам.

AI2D

88.9%Сам.

MuirBench

80.3%Сам.

VideoMME w/o sub.

77.3%Сам.

MVBench

73.2%Сам.

CharXiv-R

65.2%Сам.

Reasoning

Hallusion Bench

67.4%Сам.

ERQA

52.3%Сам.

Spatial Reasoning

RealWorldQA

78.4%Сам.

Индексы оценки AA

Math Index

84.7

Intelligence Index

17.9

Aime 25

0.8

Mmlu Pro

0.8

Livecodebench

0.7

Gpqa

0.7

Ifbench

0.6

Lcr

0.6

Tau2

0.5

Scicode

0.3

Hle

0.1

Terminalbench Hard

0.1

Оценки категорий LLM Stats

Communication

Multimodal

Instruction Following

Language

Legal

Math

Structured Output

Finance

Grounding

Healthcare

Creativity

Writing

Image To Text

Physics

Reasoning

Spatial Reasoning

General

Biology

Chemistry

Tool Calling

Video

Vision

Long Context

Factuality

Agents

Economics

Цены

Цена ввода$0.7 / 1M токенов

Цена вывода$8.4 / 1M токенов

Смешанная цена (3:1)$2.625 / 1M токенов

Скорость

Токенов/сек94.2

Задержка первого токена1.43s

Время до первого ответа22.66s

Рейтинг цен провайдеров

5 провайдеров

Самый дешевый: OpenRouterСамый дорогой: Alibaba

ПровайдерВводВывод

1OpenRouterСамый дешевый

$0.104

$0.416

2Kilo Gateway

$0.104

$0.416

3SiliconFlow (China)

$0.2

$1.5

4SiliconFlow

$0.2

$1.5

5AlibabaОсновной

$0.7

$8.4

Сравнение цен разных API-провайдеров для этой модели.

Внешние ссылки

LLM Stats Artificial Analysis