Qwen3 VL 235B A22B (Reasoning)

AlibabaQwen오픈 웨이트Apache 2.0 · 상업적 사용 가능

설명

Qwen3-VL-235B-A22B-Thinking is the most powerful vision-language model in the Qwen series, featuring 236B parameters with MoE architecture for reasoning-enhanced multimodal understanding. Key capabilities include: Visual Agent (operates PC/mobile GUIs, recognizes elements, invokes tools), Visual Coding (generates Draw.io/HTML/CSS/JS from images/videos), Advanced Spatial Perception (2D grounding and 3D grounding for spatial reasoning and embodied AI), Long Context & Video Understanding (native 256K context expandable to 1M, handles hours-long video with second-level indexing), Enhanced Multimodal Reasoning (excels in STEM/Math with causal analysis), Upgraded Visual Recognition (celebrities, anime, products, landmarks, flora/fauna), and Expanded OCR (32 languages, robust in low light/blur/tilt). Architecture innovations include Interleaved-MRoPE for positional embeddings, DeepStack for multi-level ViT feature fusion, and Text-Timestamp Alignment for precise video temporal modeling.

출시일

2025-09-23

파라미터

236.0B

컨텍스트 길이

131K

모달리티

image, text, video

능력 레이더

general

coding

reasoning

science추정

agents

100

multimodal

전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.

랭킹

도메인	#순위	점수	소스
에이전트형 역량	19	66.0	LS
코딩 랭킹	158	55.0	AA
종합 랭킹	165	55.0	AA
수학 추론	49	89.0	AA
멀티모달 랭킹	73	67.0	LS
추론	40	75.0	LS
과학	155	54.0	AA

벤치마크 점수 (LLM Stats)

3d

Objectron

0.71 / 100자체 보고

BLINK

67.1%자체 보고

ARKitScenes

0.54 / 100자체 보고

SUNRGBD

0.35 / 100자체 보고

Hypersim

0.11 / 100자체 보고

Agents

SIFO

0.77 / 100자체 보고

BFCL-v3

71.9%자체 보고

SIFO-Multiturn

0.71 / 100자체 보고

OSWorld-G

0.68 / 100자체 보고

OSWorld

38.1%자체 보고

Chemistry

SuperGPQA

64.3%자체 보고

Code

Design2Code

0.93 / 100자체 보고

Communication

MM-MT-Bench

8.50 / 100자체 보고

WritingBench

86.7%자체 보고

Multi-IF

79.1%자체 보고

Creativity

Creative Writing v3

85.7%자체 보고

Embodied

EmbSpatialBench

0.84 / 100자체 보고

RoboSpatialHome

0.74 / 100자체 보고

Factuality

SimpleQA

44.4%자체 보고

Finance

MMLU

90.6%자체 보고

MMLU-Pro

83.8%자체 보고

MMLU-ProX

80.6%자체 보고

General

MMLU-Redux

93.7%자체 보고

IFEval

88.2%자체 보고

MMMUval

80.6%자체 보고

Include

80.0%자체 보고

LiveBench 20241125

79.6%자체 보고

MMStar

78.7%자체 보고

LiveCodeBench v6

70.1%자체 보고

MMMU-Pro

69.3%자체 보고

SimpleVQA

0.61 / 100자체 보고

Grounding

ScreenSpot

95.4%자체 보고

RefCOCO-avg

0.92 / 100자체 보고

RefSpatialBench

0.70 / 100자체 보고

ScreenSpot Pro

61.8%자체 보고

Healthcare

VideoMMMU

80.0%자체 보고

Image To Text

OCRBench

87.5%자체 보고

OCRBench-V2 (en)

66.8%자체 보고

OCRBench-V2 (zh)

63.5%자체 보고

Instruction Following

MIABench

0.93 / 100자체 보고

Language

CharadesSTA

63.5%자체 보고

Long Context

MLVU

83.8%자체 보고

LVBench

63.6%자체 보고

MMLongBench-Doc

0.56 / 100자체 보고

Math

AIME 2025

89.7%자체 보고

MathVista-Mini

85.8%자체 보고

MathVerse-Mini

0.85 / 100자체 보고

HMMT25

77.4%자체 보고

MathVision

74.6%자체 보고

Humanity's Last Exam

13.6%자체 보고

Multimodal

DocVQAtest

96.5%자체 보고

MMBench-V1.1

90.6%자체 보고

InfoVQAtest

89.5%자체 보고

AI2D

89.2%자체 보고

CC-OCR

81.5%자체 보고

MuirBench

80.1%자체 보고

VideoMME w/o sub.

79.0%자체 보고

CharXiv-R

66.1%자체 보고

VisuLogic

0.34 / 100자체 보고

ZEROBench-Sub

0.28 / 100자체 보고

ZEROBench

0.04 / 100자체 보고

Reasoning

ZebraLogic

97.3%자체 보고

CountBench

0.94 / 100자체 보고

Hallusion Bench

66.7%자체 보고

ERQA

52.5%자체 보고

Spatial Reasoning

RealWorldQA

81.3%자체 보고

Vision

ODinW

43.2%자체 보고

AA 평가 지수

Math Index

88.3

Intelligence Index

20.6

Aime 25

0.9

Mmlu Pro

0.8

Gpqa

0.8

Livecodebench

0.6

Lcr

0.6

Ifbench

0.6

Tau2

0.5

Scicode

0.4

Terminalbench Hard

0.1

Hle

0.1

LLM Stats 카테고리 점수

Communication

Multimodal

100

Creativity

Writing

Instruction Following

Language

Legal

Math

Structured Output

Embodied

Finance

Grounding

Healthcare

Text-to-image

Video

Image To Text

Long Context

Reasoning

Spatial Reasoning

General

Tool Calling

Vision

Physics

Agents

Chemistry

Economics

Factuality

가격

입력 가격$0.84 / 1M 토큰

출력 가격$6.175 / 1M 토큰

혼합 가격 (3:1)$2.174 / 1M 토큰

속도

토큰/초57.2

첫 토큰 지연1.16s

첫 응답 지연36.11s

공급자 가격 순위

10개 공급자

최저가: Venice AI최고가: NovitaAI

공급자입력출력

1Venice AI최저가

$0.25

$1.5

2OpenRouter

$0.26

$2.6

3Kilo Gateway

$0.26

$2.6

4Alibaba (China)

$0.28671

$1.14682

5SiliconFlow (China)

$0.45

$3.5

6SiliconFlow

$0.45

$3.5

7NanoGPT

$0.5

8LLM Gateway

$0.5

9Alibaba

$0.7

$2.8

10NovitaAI

$0.98

$3.95

이 모델의 다양한 API 공급자 간 가격 비교.

외부 링크

LLM Stats Artificial Analysis