Qwen2.5 VL 72B Instruct

Alibaba Cloud / Qwen TeamQwen開源權重tongyi-qianwen

描述

Qwen2.5-VL is the new flagship vision-language model of Qwen, significantly improved from Qwen2-VL. It excels at recognizing objects, analyzing text/charts/layouts in images, acting as a visual agent, understanding long videos (over 1 hour) with event pinpointing, performing visual localization (bounding boxes/points), and generating structured outputs from documents.

發布日期

2025-01-26

參數規模

72.0B

上下文長度

131K

支援模態

image, text

能力雷達圖

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域	#排名	分數	來源
智慧體能力模型榜	98	45.0	LS
多模態榜	59	73.0	LS
推理能力	79	55.0	LS

基準測試分數 (LLM Stats)

Agents

AITZ_EM

83.2%自報

MobileMiniWob++_SR

68.0%自報

AndroidWorld_SR

35.0%自報

OSWorld

8.8%自報

General

MMVet

76.2%自報

MLVU-M

74.6%自報

MMStar

70.8%自報

MMMU

70.2%自報

MMMU-Pro

51.1%自報

Grounding

ScreenSpot

87.1%自報

ScreenSpot Pro

43.6%自報

Image To Text

DocVQA

96.4%自報

OCRBench

88.5%自報

OCRBench-V2 (en)

61.5%自報

Long Context

EgoSchema

76.2%自報

LVBench

47.3%自報

Math

MathVista-Mini

74.8%自報

MathVision

38.1%自報

Multimodal

Android Control Low_EM

93.7%自報

ChartQA

89.5%自報

AI2D

88.4%自報

MMBench

88.0%自報

CC-OCR

79.8%自報

TempCompass

74.8%自報

VideoMME w/o sub.

73.3%自報

PerceptionTest

73.2%自報

MVBench

70.4%自報

Android Control High_EM

67.4%自報

MMBench-Video

2.0%自報

Reasoning

Hallusion Bench

55.2%自報

AA 評測指數

暫無 AA 評測資料

LLM Stats 分類評分

Image To Text

Structured Output

Text-to-image

Reasoning

Spatial Reasoning

Grounding

Healthcare

Long Context

Math

Multimodal

Vision

General

Video

Agents

定價

輸入價格$2.8 / 1M tokens

輸出價格$8.4 / 1M tokens

混合價格(3:1)$4.2 / 1M tokens

速度

暫無速度資料

供應商價格排行

12 個供應商

最便宜: Nebius Token Factory最貴: LLM Gateway

供應商輸入輸出

1Nebius Token Factory最便宜

$0.25

$0.75

2SiliconFlow (China)

$0.59

3SiliconFlow

$0.59

4NanoGPT

$0.69989

5OpenRouter

$0.8

6NovitaAI

$0.8

7Kilo Gateway

$0.8

8OVHcloud AI Endpoints

$1.01

9Alibaba (China)

$2.294

$6.881

10Alibaba Cloud / Qwen Team主要

$2.8

$8.4

11Alibaba

$2.8

$8.4

12LLM Gateway

$2.8

$8.4

比較該模型在不同 API 供應商之間的定價。

外部連結

LLM Stats Artificial Analysis