Qwen3 VL 235B A22B Instruct

AlibabaQwenОткрытые весаApache 2.0 · Коммерческое использование

Описание

Qwen3-VL is a large multimodal model that unifies vision, language, and reasoning to achieve human-level perception and cognition across text, images, and video. Built on a 235B-parameter architecture, it integrates early joint training of visual and textual modalities for strong language grounding. The model supports up to a 1 million-token context window and excels at visual understanding, spatial reasoning, long video comprehension, and tool-based interaction. It can generate code from images, perform precise 2D/3D object grounding, and operate digital interfaces like a visual agent. The “Instruct” version rivals Gemini 2.5 Pro in perception benchmarks, while the “Thinking” version leads in multimodal reasoning and STEM tasks. With multilingual OCR, creative writing, and fine-grained scene interpretation, Qwen3-VL establishes a new open-source frontier for integrated vision-language intelligence.

Дата выхода

2025-09-23

Параметры

236.0B

Длина контекста

262K

Модальности

image, text, video

Радар способностей

general

coding

reasoning

scienceоцен.

agents

multimodal

Science использует прокси на основе рассуждений, когда специализированные научные бенчмарки недоступны.

Рейтинги

Домен	#Место	Оценка	Источник
Агентные возможности	16	67.0	LS
Рейтинг кодинга	251	39.0	AA
Общий рейтинг	244	43.0	AA
Математическое мышление	118	71.0	AA
Мультимодальный рейтинг	36	79.0	LS
Рассуждения	77	56.0	LS
Наука	209	48.0	AA

Оценки бенчмарков (LLM Stats)

3d

BLINK

70.7%Сам.

Agents

BFCL-v3

67.7%Сам.

OSWorld

66.7%Сам.

AndroidWorld_SR

63.7%Сам.

Chemistry

SuperGPQA

60.4%Сам.

Communication

MM-MT-Bench

8.50 / 100Сам.

WritingBench

85.5%Сам.

Multi-IF

76.3%Сам.

Creativity

Creative Writing v3

86.5%Сам.

Arena-Hard v2

77.4%Сам.

Factuality

SimpleQA

51.9%Сам.

Finance

MMLU

88.8%Сам.

MMLU-Pro

81.8%Сам.

MMLU-ProX

77.8%Сам.

General

MMLU-Redux

92.2%Сам.

IFEval

87.8%Сам.

MultiPL-E

86.1%Сам.

CSimpleQA

83.4%Сам.

Include

80.0%Сам.

MMMUval

78.7%Сам.

MMStar

78.4%Сам.

LiveBench 20241125

74.8%Сам.

MMMU-Pro

68.1%Сам.

LiveCodeBench v5

61.4%Сам.

LiveCodeBench v6

54.3%Сам.

Grounding

ScreenSpot

95.4%Сам.

ScreenSpot Pro

62.0%Сам.

Healthcare

VideoMMMU

74.7%Сам.

Image To Text

OCRBench

92.0%Сам.

OCRBench-V2 (en)

67.1%Сам.

OCRBench-V2 (zh)

61.8%Сам.

Language

CharadesSTA

64.8%Сам.

Long Context

MLVU

84.3%Сам.

LVBench

67.7%Сам.

Math

MathVista-Mini

84.9%Сам.

AIME 2025

74.7%Сам.

MathVision

66.5%Сам.

HMMT25

57.4%Сам.

Multimodal

DocVQAtest

97.1%Сам.

MMBench-V1.1

89.9%Сам.

AI2D

89.7%Сам.

InfoVQAtest

89.2%Сам.

CC-OCR

82.2%Сам.

VideoMME w/o sub.

79.2%Сам.

MuirBench

72.8%Сам.

CharXiv-R

62.1%Сам.

Reasoning

Hallusion Bench

63.2%Сам.

ERQA

51.3%Сам.

Spatial Reasoning

RealWorldQA

79.3%Сам.

Vision

ODinW

48.6%Сам.

Индексы оценки AA

Math Index

70.7

Intelligence Index

14.3

Mmlu Pro

0.8

Gpqa

0.7

Aime 25

0.7

Livecodebench

0.6

Ifbench

0.4

Scicode

0.4

Tau2

0.4

Lcr

0.3

Terminalbench Hard

0.1

Hle

0.1

Оценки категорий LLM Stats

Communication

Multimodal

Instruction Following

Language

Legal

Long Context

Math

Structured Output

Finance

Grounding

Healthcare

Creativity

Text-to-image

Video

Writing

Image To Text

Reasoning

Spatial Reasoning

General

Agents

Tool Calling

Vision

Physics

Chemistry

Economics

Factuality

Цены

Цена ввода$0.3 / 1M токенов

Цена вывода$1.9 / 1M токенов

Смешанная цена (3:1)$0.7 / 1M токенов

Цена чтения кэша$0.11 / 1M токенов

Скорость

Токенов/сек52.9

Задержка первого токена1.15s

Время до первого ответа1.15s

Рейтинг цен провайдеров

14 провайдеров

Самый дешевый: DeepInfraСамый дорогой: STACKIT

ПровайдерВводВывод

1DeepInfraСамый дешевый

2Novita

3OpenRouter

$0.2

$0.88

4Kilo Gateway

$0.2

$0.88

5AlibabaОсновной

$0.3

$1.9

6SiliconFlow (China)

$0.3

$1.5

7NovitaAI

$0.3

$1.5

8Helicone

$0.3

$1.5

9Amazon Bedrock

$0.3

$1.5

10SiliconFlow

$0.3

$1.5

11LLM Gateway

$0.3

$1.5

12Vercel AI Gateway

$0.4

$1.6

13NanoGPT

$0.5

$1.2

14STACKIT

$1.64

$1.91

Сравнение цен разных API-провайдеров для этой модели.

Внешние ссылки

LLM Stats Artificial Analysis