Qwen2.5 VL 72B Instruct

Alibaba Cloud / Qwen TeamQwenオープンウエイトtongyi-qianwen

説明

Qwen2.5-VL is the new flagship vision-language model of Qwen, significantly improved from Qwen2-VL. It excels at recognizing objects, analyzing text/charts/layouts in images, acting as a visual agent, understanding long videos (over 1 hour) with event pinpointing, performing visual localization (bounding boxes/points), and generating structured outputs from documents.

リリース日

2025-01-26

パラメータ

72.0B

コンテキスト長

131K

モダリティ

image, text

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Agents

AITZ_EM

83.2%自己申告

MobileMiniWob++_SR

68.0%自己申告

AndroidWorld_SR

35.0%自己申告

OSWorld

8.8%自己申告

General

MMVet

76.2%自己申告

MLVU-M

74.6%自己申告

MMStar

70.8%自己申告

MMMU

70.2%自己申告

MMMU-Pro

51.1%自己申告

Grounding

ScreenSpot

87.1%自己申告

ScreenSpot Pro

43.6%自己申告

Image To Text

DocVQA

96.4%自己申告

OCRBench

88.5%自己申告

OCRBench-V2 (en)

61.5%自己申告

Long Context

EgoSchema

76.2%自己申告

LVBench

47.3%自己申告

Math

MathVista-Mini

74.8%自己申告

MathVision

38.1%自己申告

Multimodal

Android Control Low_EM

93.7%自己申告

ChartQA

89.5%自己申告

AI2D

88.4%自己申告

MMBench

88.0%自己申告

CC-OCR

79.8%自己申告

TempCompass

74.8%自己申告

VideoMME w/o sub.

73.3%自己申告

PerceptionTest

73.2%自己申告

MVBench

70.4%自己申告

Android Control High_EM

67.4%自己申告

MMBench-Video

2.0%自己申告

Reasoning

Hallusion Bench

55.2%自己申告

AA評価指数

AA評価データがありません

LLM Statsカテゴリスコア

Image To Text

Structured Output

Text-to-image

Reasoning

Spatial Reasoning

Grounding

Healthcare

Long Context

Math

Multimodal

Vision

General

Video

Agents

価格設定

入力価格$2.8 / 1Mトークン

出力価格$8.4 / 1Mトークン

混合価格（3:1）$4.2 / 1Mトークン

速度

速度データがありません

プロバイダー価格ランキング

12 プロバイダー

最安: Nebius Token Factory最高: LLM Gateway

プロバイダー入力出力

1Nebius Token Factory最安

$0.25

$0.75

2SiliconFlow (China)

$0.59

3SiliconFlow

$0.59

4NanoGPT

$0.69989

5OpenRouter

$0.8

6NovitaAI

$0.8

7Kilo Gateway

$0.8

8OVHcloud AI Endpoints

$1.01

9Alibaba (China)

$2.294

$6.881

10Alibaba Cloud / Qwen Teamプライマリ

$2.8

$8.4

11Alibaba

$2.8

$8.4

12LLM Gateway

$2.8

$8.4

このモデルの異なるAPIプロバイダー間の価格を比較。

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
エージェント能力	98	45.0	LS
マルチモーダルランキング	59	73.0	LS
推論	79	55.0	LS