Qwen3 VL 30B A3B (Reasoning)

AlibabaQwenОткрытые весаApache 2.0 · Коммерческое использование

Описание

Qwen3-VL is a large multimodal model that unifies vision, language, and reasoning to achieve human-level perception and cognition across text, images, and video. Built on a 235B-parameter architecture, it integrates early joint training of visual and textual modalities for strong language grounding. The model supports up to a 1 million-token context window and excels at visual understanding, spatial reasoning, long video comprehension, and tool-based interaction. It can generate code from images, perform precise 2D/3D object grounding, and operate digital interfaces like a visual agent. The “Instruct” version rivals Gemini 2.5 Pro in perception benchmarks, while the “Thinking” version leads in multimodal reasoning and STEM tasks. With multilingual OCR, creative writing, and fine-grained scene interpretation, Qwen3-VL establishes a new open-source frontier for integrated vision-language intelligence.

Дата выхода

2025-10-03

Параметры

31.0B

Длина контекста

131K

Модальности

image, text, video

Радар способностей

general

coding

reasoning

scienceоцен.

agents

100

multimodal

Science использует прокси на основе рассуждений, когда специализированные научные бенчмарки недоступны.

Рейтинги

Домен	#Место	Оценка	Источник
Агентные возможности	69	54.0	LS
Рейтинг кодинга	211	46.0	AA
Общий рейтинг	276	39.0	AA
Математическое мышление	73	83.0	AA
Мультимодальный рейтинг	42	77.0	LS
Рассуждения	85	53.0	LS
Наука	247	45.0	AA

Оценки бенчмарков (LLM Stats)

3d

BLINK

65.4%Сам.

Agents

BFCL-v3

68.6%Сам.

OSWorld

30.6%Сам.

Biology

GPQA

74.4%Сам.

Chemistry

SuperGPQA

56.4%Сам.

Communication

MM-MT-Bench

7.90 / 100Сам.

WritingBench

85.2%Сам.

Multi-IF

73.0%Сам.

Creativity

Creative Writing v3

82.5%Сам.

Arena-Hard v2

56.7%Сам.

Factuality

SimpleQA

23.9%Сам.

Finance

MMLU

87.6%Сам.

MMLU-Pro

80.5%Сам.

MMLU-ProX

76.1%Сам.

General

MMLU-Redux

90.9%Сам.

IFEval

81.7%Сам.

MLVU-M

78.9%Сам.

MMMU (val)

76.0%Сам.

MMStar

75.5%Сам.

Include

74.5%Сам.

LiveBench 20241125

72.1%Сам.

LiveCodeBench v6

64.2%Сам.

MMMU-Pro

63.0%Сам.

Grounding

ScreenSpot

94.7%Сам.

ScreenSpot Pro

57.3%Сам.

Healthcare

VideoMMMU

75.0%Сам.

Image To Text

OCRBench

83.9%Сам.

OCRBench-V2 (en)

62.6%Сам.

OCRBench-V2 (zh)

60.4%Сам.

Language

CharadesSTA

62.7%Сам.

Long Context

LVBench

59.2%Сам.

Math

AIME 2025

83.1%Сам.

MathVista-Mini

81.9%Сам.

HMMT25

67.6%Сам.

MathVision

65.7%Сам.

PolyMATH

51.7%Сам.

Multimodal

DocVQAtest

95.0%Сам.

MMBench-V1.1

88.9%Сам.

CharXiv-D

86.9%Сам.

AI2D

86.9%Сам.

InfoVQAtest

86.0%Сам.

CC-OCR

77.8%Сам.

MuirBench

77.6%Сам.

Video-MME

73.3%Сам.

MVBench

72.0%Сам.

CharXiv-R

56.6%Сам.

Reasoning

Hallusion Bench

66.0%Сам.

ERQA

45.3%Сам.

Spatial Reasoning

RealWorldQA

77.4%Сам.

Vision

ODinW

42.3%Сам.

Индексы оценки AA

Math Index

82.3

Intelligence Index

13.3

Aime 25

0.8

Mmlu Pro

0.8

Gpqa

0.7

Livecodebench

0.7

Ifbench

0.5

Lcr

0.4

Scicode

0.3

Tau2

0.2

Hle

0.1

Terminalbench Hard

0.1

Оценки категорий LLM Stats

Communication

Multimodal

100

Instruction Following

Language

Legal

Structured Output

Finance

Grounding

Healthcare

Text-to-image

Image To Text

Math

Physics

Reasoning

Spatial Reasoning

General

Biology

Chemistry

Creativity

Tool Calling

Video

Vision

Writing

Long Context

Economics

Agents

Factuality

Цены

Цена ввода$0.2 / 1M токенов

Цена вывода$0.75 / 1M токенов

Смешанная цена (3:1)$0.338 / 1M токенов

Скорость

Токенов/сек122.7

Задержка первого токена1.01s

Время до первого ответа17.31s

Рейтинг цен провайдеров

2 провайдеров

Самый дешевый: Alibaba (China)Самый дорогой: Alibaba

ПровайдерВводВывод

1Alibaba (China)Самый дешевый

$0.108

$0.431

2AlibabaОсновной

$0.2

$0.75

Сравнение цен разных API-провайдеров для этой модели.

Внешние ссылки

LLM Stats Artificial Analysis