Qwen3.7-Plus
Description
Qwen3.7-Plus is Alibaba Cloud Qwen Team's multimodal agent model that unifies vision and language into a single agent foundation. Built on the Qwen3.7 text backbone, it operates as a multimodal interactive hybrid agent—perceiving real-world scenes, reading screens and operating GUIs, writing code from visual references, navigating mobile apps end-to-end, and answering search-augmented visual questions—while blending GUI and CLI interactions within a single agent loop. It is a versatile coding agent and productivity assistant with full-modality input, generalizing across scaffolds such as Claude Code, OpenClaw, and Qwen Code. Features a 1 million token context window, up to 65,536 output tokens, always-on thinking, and a preserve_thinking mode for agentic tasks. Available via Alibaba Cloud Model Studio (DashScope).
Capability Radar
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Agentic Capability | 41 | 59.0 | LS |
| Multimodal Ranking | 49 | 75.0 | LS |
| Reasoning | 70 | 58.0 | LS |
Benchmark Scores (LLM Stats)
Agents
Biology
Chemistry
Code
Finance
General
Grounding
Healthcare
Image To Text
Knowledge
Language
Long Context
Math
Multimodal
Reasoning
Spatial Reasoning
Vision
AA Evaluation Indices
No AA evaluation data available
LLM Stats Category Scores
Pricing
Speed
No speed data available
Provider Price Ranking
Provider Price Ranking
6 providers
Compare pricing across different API providers for this model.