Qwen2.5 VL 72B Instruct

Alibaba Cloud / Qwen TeamQwen开源权重tongyi-qianwen

描述

Qwen2.5-VL is the new flagship vision-language model of Qwen, significantly improved from Qwen2-VL. It excels at recognizing objects, analyzing text/charts/layouts in images, acting as a visual agent, understanding long videos (over 1 hour) with event pinpointing, performing visual localization (bounding boxes/points), and generating structured outputs from documents.

发布日期

2025-01-26

参数规模

72.0B

上下文长度

131K

支持模态

image, text

能力雷达图

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域	#排名	分数	来源
智能体能力模型榜	98	45.0	LS
多模态榜	59	73.0	LS
推理能力	79	55.0	LS

基准测试分数 (LLM Stats)

Agents

AITZ_EM

83.2%自报

MobileMiniWob++_SR

68.0%自报

AndroidWorld_SR

35.0%自报

OSWorld

8.8%自报

General

MMVet

76.2%自报

MLVU-M

74.6%自报

MMStar

70.8%自报

MMMU

70.2%自报

MMMU-Pro

51.1%自报

Grounding

ScreenSpot

87.1%自报

ScreenSpot Pro

43.6%自报

Image To Text

DocVQA

96.4%自报

OCRBench

88.5%自报

OCRBench-V2 (en)

61.5%自报

Long Context

EgoSchema

76.2%自报

LVBench

47.3%自报

Math

MathVista-Mini

74.8%自报

MathVision

38.1%自报

Multimodal

Android Control Low_EM

93.7%自报

ChartQA

89.5%自报

AI2D

88.4%自报

MMBench

88.0%自报

CC-OCR

79.8%自报

TempCompass

74.8%自报

VideoMME w/o sub.

73.3%自报

PerceptionTest

73.2%自报

MVBench

70.4%自报

Android Control High_EM

67.4%自报

MMBench-Video

2.0%自报

Reasoning

Hallusion Bench

55.2%自报

AA 评测指数

暂无 AA 评测数据

LLM Stats 分类评分

Image To Text

Structured Output

Text-to-image

Reasoning

Spatial Reasoning

Grounding

Healthcare

Long Context

Math

Multimodal

Vision

General

Video

Agents

定价

输入价格$2.8 / 1M tokens

输出价格$8.4 / 1M tokens

混合价格(3:1)$4.2 / 1M tokens

速度

暂无速度数据

供应商价格排行

12 个供应商

最便宜: Nebius Token Factory最贵: LLM Gateway

供应商输入输出

1Nebius Token Factory最便宜

$0.25

$0.75

2SiliconFlow (China)

$0.59

3SiliconFlow

$0.59

4NanoGPT

$0.69989

5OpenRouter

$0.8

6NovitaAI

$0.8

7Kilo Gateway

$0.8

8OVHcloud AI Endpoints

$1.01

9Alibaba (China)

$2.294

$6.881

10Alibaba Cloud / Qwen Team主要

$2.8

$8.4

11Alibaba

$2.8

$8.4

12LLM Gateway

$2.8

$8.4

比较该模型在不同 API 供应商之间的定价。

外部链接

LLM Stats Artificial Analysis