Qwen3.7-Plus

Alibaba Cloud / Qwen TeamQwenProprietary

Description

Qwen3.7-Plus is Alibaba Cloud Qwen Team's multimodal agent model that unifies vision and language into a single agent foundation. Built on the Qwen3.7 text backbone, it operates as a multimodal interactive hybrid agent—perceiving real-world scenes, reading screens and operating GUIs, writing code from visual references, navigating mobile apps end-to-end, and answering search-augmented visual questions—while blending GUI and CLI interactions within a single agent loop. It is a versatile coding agent and productivity assistant with full-modality input, generalizing across scaffolds such as Claude Code, OpenClaw, and Qwen Code. Features a 1 million token context window, up to 65,536 output tokens, always-on thinking, and a preserve_thinking mode for agentic tasks. Available via Alibaba Cloud Model Studio (DashScope).

Release Date

2026-05-31

Parameters

—

Context Length

1.0M

Modalities

image, text, video

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Agentic Capability	41	59.0	LS
Multimodal Ranking	49	75.0	LS
Reasoning	70	58.0	LS

Benchmark Scores (LLM Stats)

Agents

GDPval-AA

946.00 / 3000SR

SpreadSheetBench-v1

86.3%SR

AndroidWorld

81.0%SR

OSWorld-Verified

73.3%SR

MCP Atlas

73.2%SR

BFCL-V4

72.9%SR

Terminal-Bench 2.0

70.3%SR

CoWorkBench

65.1%SR

Claw-Eval

62.7%SR

DeepPlanning

62.3%SR

QwenWorldBench

62.1%SR

QwenClawBench

61.8%SR

MCP-Mark

58.7%SR

SWE-Bench Pro

57.6%SR

ClawEval-MM

55.7%SR

SkillsBench

54.9%SR

VITA-Bench

45.6%SR

MMSearch-Plus

41.4%SR

NL2Repo

41.1%SR

Finance Agent v2

38.2%SR

Biology

GPQA

90.3%SR

SciCode

51.3%SR

Chemistry

SuperGPQA

71.4%SR

Code

SWE-Bench Verified

77.7%SR

SWE-bench Multilingual

75.8%SR

Finance

MMLU-Pro

88.5%SR

MMLU-ProX

85.4%SR

General

IFEval

94.6%SR

MMLU-Redux

94.5%SR

MRCR v2

91.7%SR

Global PIQA

90.3%SR

LiveCodeBench v6

89.6%SR

MMMLU

89.0%SR

MAXIFE

88.8%SR

Include

83.0%SR

SimpleVQA

0.82 / 100SR

IFBench

79.1%SR

MMMU-Pro

79.0%SR

NOVA-63

58.8%SR

Grounding

ScreenSpot Pro

79.0%SR

Healthcare

VideoMMMU

85.4%SR

Image To Text

OCRBench_V2

67.1%SR

Knowledge

MedXpertQA-MM

71.0%SR

BC-VL

51.1%SR

MMBC

46.3%SR

Language

WMT24++

84.6%SR

LingoQA

83.4%SR

Long Context

MLVU

87.4%SR

LVBench

76.2%SR

Math

HMMT Feb 26

92.9%SR

MathVision

90.3%SR

IMO-AnswerBench

86.0%SR

PolyMATH

84.0%SR

Humanity's Last Exam

34.7%SR

CritPT

6.0%SR

Multimodal

OmniDocBench 1.5

91.4%SR

Video-MME

88.0%SR

CharXiv-R

85.9%SR

HiPhO

84.1%SR

TVBench

78.2%SR

VLADBench

77.2%SR

SURDS

77.2%SR

CountQA

77.0%SR

BabyVision

70.4%SR

WorldVQA

61.1%SR

VisFactor

42.8%SR

Reasoning

ERQA

69.8%SR

Apex

22.7%SR

Spatial Reasoning

RealWorldQA

86.9%SR

Vision

ODinW

51.1%SR

AA Evaluation Indices

No AA evaluation data available

LLM Stats Category Scores

Legal

100

Finance

100

Agents

General

Reasoning

Structured Output

Instruction Following

Language

Long Context

Productivity

Video

Spatial Reasoning

Multimodal

Physics

Frontend Development

Grounding

Healthcare

Vision

Image To Text

Math

Biology

Chemistry

Code

Economics

Tool Calling

Coding

Pricing

Input Price$0.5 / 1M tokens

Output Price$3 / 1M tokens

Blended Price (3:1)$1.125 / 1M tokens

Cache Read Price$0.05 / 1M tokens

Cache Write Price$0.625 / 1M tokens

Speed

No speed data available

Provider Price Ranking

6 providers

Cheapest: NanoGPTMost Expensive: Alibaba (China)

ProviderInputOutput

1NanoGPTCheapest

$0.4

$1.6

2OpenCode Go

$0.4

$1.6

3LLM Gateway

$0.4

$1.6

4Alibaba Cloud / Qwen TeamPRIMARY

$0.5

5Alibaba

$0.5

6Alibaba (China)

$0.5

Compare pricing across different API providers for this model.

External Sources

LLM Stats Artificial Analysis