Qwen2.5-Omni-7B

Alibaba Cloud / Qwen TeamQwenOpen WeightApache 2.0 · Commercial OK

Description

Qwen2.5-Omni is the flagship end-to-end multimodal model in the Qwen series. It processes diverse inputs including text, images, audio, and video, delivering real-time streaming responses through text generation and natural speech synthesis using a novel Thinker-Talker architecture.

Release Date

2025-03-27

Parameters

7.0B

Context Length

33K

Modalities

audio, image, text, video

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Multimodal Ranking	57	74.0	LS

Benchmark Scores (LLM Stats)

Audio

VocalSound

93.9%SR

GiantSteps Tempo

88.0%SR

MMAU Music

69.2%SR

MMAU Sound

67.9%SR

MMAU

65.6%SR

MMAU Speech

59.8%SR

OmniBench Music

52.8%SR

CoVoST2 en-zh

0.41 / 100SR

MusicCaps

32.8%SR

Common Voice 15

0.08 / 100SR

Biology

GPQA

30.8%SR

Code

HumanEval

78.7%SR

Communication

VoiceBench Avg

74.1%SR

MM-MT-Bench

0.06 / 100SR

Creativity

Meld

57.0%SR

Finance

MMLU-Pro

47.0%SR

General

MBPP

0.73 / 100SR

MMLU-Redux

71.0%SR

MultiPL-E

65.8%SR

MMStar

64.0%SR

MME-RealWorld

61.6%SR

MMMU

59.2%SR

MMMU-Pro

36.6%SR

LiveBench

29.6%SR

NMOS

0.05 / 100SR

Grounding

PointGrounding

66.5%SR

Healthcare

CRPErelation

76.5%SR

Image To Text

DocVQA

95.2%SR

TextVQA

84.4%SR

OCRBench_V2

57.8%SR

Language

FLEURS

95.9%SR

Long Context

EgoSchema

68.6%SR

Math

GSM8k

88.7%SR

MATH

71.5%SR

MathVista

67.9%SR

MathVision

25.0%SR

Multimodal

ChartQA

85.3%SR

AI2D

83.2%SR

MMBench-V1.1

81.8%SR

VideoMME w sub.

72.4%SR

MVBench

70.3%SR

MuirBench

59.2%SR

OmniBench

56.1%SR

Spatial Reasoning

RealWorldQA

70.3%SR

Vision

ODinW

42.4%SR

AA Evaluation Indices

No AA evaluation data available

LLM Stats Category Scores

Speech To Text

100

Image To Text

Code

Language

Long Context

Spatial Reasoning

Video

Vision

Math

Multimodal

Reasoning

Legal

Finance

General

Healthcare

Physics

Biology

Chemistry

Communication

Pricing

Input Price$0.1 / 1M tokens

Output Price$0.4 / 1M tokens

Blended Price (3:1)$0.175 / 1M tokens

Speed

No speed data available

Provider Price Ranking

3 providers

Cheapest: Alibaba (China)Most Expensive: Alibaba

ProviderInputOutput

1Alibaba (China)Cheapest

$0.087

$0.345

2Alibaba Cloud / Qwen TeamPRIMARY

$0.1

$0.4

3Alibaba

$0.1

$0.4

Compare pricing across different API providers for this model.

External Sources

LLM Stats Artificial Analysis