Skip to main content

Qwen3.7-Plus

Alibaba Cloud / Qwen TeamQwenProprietary

Description

Qwen3.7-Plus is Alibaba Cloud Qwen Team's multimodal agent model that unifies vision and language into a single agent foundation. Built on the Qwen3.7 text backbone, it operates as a multimodal interactive hybrid agent—perceiving real-world scenes, reading screens and operating GUIs, writing code from visual references, navigating mobile apps end-to-end, and answering search-augmented visual questions—while blending GUI and CLI interactions within a single agent loop. It is a versatile coding agent and productivity assistant with full-modality input, generalizing across scaffolds such as Claude Code, OpenClaw, and Qwen Code. Features a 1 million token context window, up to 65,536 output tokens, always-on thinking, and a preserve_thinking mode for agentic tasks. Available via Alibaba Cloud Model Studio (DashScope).

Release Date
2026-05-31
Parameters
Context Length
1.0M
Modalities
image, text, video

Capability Radar

53
general
70
coding
70
reasoning
60
scienceest.
70
agents
90
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Agentic Capability41
59.0
LS
Multimodal Ranking49
75.0
LS
Reasoning70
58.0
LS

Benchmark Scores (LLM Stats)

Agents

GDPval-AA946.00 / 3000SR
SpreadSheetBench-v186.3%SR
AndroidWorld81.0%SR
OSWorld-Verified73.3%SR
MCP Atlas73.2%SR
BFCL-V472.9%SR
Terminal-Bench 2.070.3%SR
CoWorkBench65.1%SR
Claw-Eval62.7%SR
DeepPlanning62.3%SR
QwenWorldBench62.1%SR
QwenClawBench61.8%SR
MCP-Mark58.7%SR
SWE-Bench Pro57.6%SR
ClawEval-MM55.7%SR
SkillsBench54.9%SR
VITA-Bench45.6%SR
MMSearch-Plus41.4%SR
NL2Repo41.1%SR
Finance Agent v238.2%SR

Biology

GPQA90.3%SR
SciCode51.3%SR

Chemistry

SuperGPQA71.4%SR

Code

SWE-Bench Verified77.7%SR
SWE-bench Multilingual75.8%SR

Finance

MMLU-Pro88.5%SR
MMLU-ProX85.4%SR

General

IFEval94.6%SR
MMLU-Redux94.5%SR
MRCR v291.7%SR
Global PIQA90.3%SR
LiveCodeBench v689.6%SR
MMMLU89.0%SR
MAXIFE88.8%SR
Include83.0%SR
SimpleVQA0.82 / 100SR
IFBench79.1%SR
MMMU-Pro79.0%SR
NOVA-6358.8%SR

Grounding

ScreenSpot Pro79.0%SR

Healthcare

VideoMMMU85.4%SR

Image To Text

OCRBench_V267.1%SR

Knowledge

MedXpertQA-MM71.0%SR
BC-VL51.1%SR
MMBC46.3%SR

Language

WMT24++84.6%SR
LingoQA83.4%SR

Long Context

MLVU87.4%SR
LVBench76.2%SR

Math

HMMT Feb 2692.9%SR
MathVision90.3%SR
IMO-AnswerBench86.0%SR
PolyMATH84.0%SR
Humanity's Last Exam34.7%SR
CritPT6.0%SR

Multimodal

OmniDocBench 1.591.4%SR
Video-MME88.0%SR
CharXiv-R85.9%SR
HiPhO84.1%SR
TVBench78.2%SR
VLADBench77.2%SR
SURDS77.2%SR
CountQA77.0%SR
BabyVision70.4%SR
WorldVQA61.1%SR
VisFactor42.8%SR

Reasoning

ERQA69.8%SR
Apex22.7%SR

Spatial Reasoning

RealWorldQA86.9%SR

Vision

ODinW51.1%SR

AA Evaluation Indices

No AA evaluation data available

LLM Stats Category Scores

Legal
100
Finance
100
Agents
64
General
53
Reasoning
30
Structured Output
90
Instruction Following
90
Language
90
Long Context
90
Productivity
90
Video
90
Spatial Reasoning
80
Multimodal
80
Physics
80
Frontend Development
80
Grounding
80
Healthcare
80
Vision
80
Image To Text
70
Math
70
Biology
70
Chemistry
70
Code
70
Economics
70
Tool Calling
70
Coding
50

Pricing

Input Price$0.5 / 1M tokens
Output Price$3 / 1M tokens
Blended Price (3:1)$1.125 / 1M tokens
Cache Read Price$0.05 / 1M tokens
Cache Write Price$0.625 / 1M tokens

Speed

No speed data available

Provider Price Ranking

Provider Price Ranking

6 providers

Cheapest: NanoGPTMost Expensive: Alibaba (China)
ProviderInputOutput
1NanoGPTCheapest
$0.4
$1.6
2OpenCode Go
$0.4
$1.6
3LLM Gateway
$0.4
$1.6
4Alibaba Cloud / Qwen TeamPRIMARY
$0.5
$3
5Alibaba
$0.5
$3
6Alibaba (China)
$0.5
$3

Compare pricing across different API providers for this model.

External Sources