跳转到主要内容

Qwen3.7-Plus

Alibaba Cloud / Qwen TeamQwenProprietary

描述

Qwen3.7-Plus is Alibaba Cloud Qwen Team's multimodal agent model that unifies vision and language into a single agent foundation. Built on the Qwen3.7 text backbone, it operates as a multimodal interactive hybrid agent—perceiving real-world scenes, reading screens and operating GUIs, writing code from visual references, navigating mobile apps end-to-end, and answering search-augmented visual questions—while blending GUI and CLI interactions within a single agent loop. It is a versatile coding agent and productivity assistant with full-modality input, generalizing across scaffolds such as Claude Code, OpenClaw, and Qwen Code. Features a 1 million token context window, up to 65,536 output tokens, always-on thinking, and a preserve_thinking mode for agentic tasks. Available via Alibaba Cloud Model Studio (DashScope).

发布日期
2026-05-31
参数规模
上下文长度
1.0M
支持模态
image, text, video

能力雷达图

53
general
70
coding
70
reasoning
60
science估算
70
agents
90
multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域#排名分数来源
智能体能力模型榜41
59.0
LS
多模态榜49
75.0
LS
推理能力70
58.0
LS

基准测试分数 (LLM Stats)

Agents

GDPval-AA946.00 / 3000自报
SpreadSheetBench-v186.3%自报
AndroidWorld81.0%自报
OSWorld-Verified73.3%自报
MCP Atlas73.2%自报
BFCL-V472.9%自报
Terminal-Bench 2.070.3%自报
CoWorkBench65.1%自报
Claw-Eval62.7%自报
DeepPlanning62.3%自报
QwenWorldBench62.1%自报
QwenClawBench61.8%自报
MCP-Mark58.7%自报
SWE-Bench Pro57.6%自报
ClawEval-MM55.7%自报
SkillsBench54.9%自报
VITA-Bench45.6%自报
MMSearch-Plus41.4%自报
NL2Repo41.1%自报
Finance Agent v238.2%自报

Biology

GPQA90.3%自报
SciCode51.3%自报

Chemistry

SuperGPQA71.4%自报

Code

SWE-Bench Verified77.7%自报
SWE-bench Multilingual75.8%自报

Finance

MMLU-Pro88.5%自报
MMLU-ProX85.4%自报

General

IFEval94.6%自报
MMLU-Redux94.5%自报
MRCR v291.7%自报
Global PIQA90.3%自报
LiveCodeBench v689.6%自报
MMMLU89.0%自报
MAXIFE88.8%自报
Include83.0%自报
SimpleVQA0.82 / 100自报
IFBench79.1%自报
MMMU-Pro79.0%自报
NOVA-6358.8%自报

Grounding

ScreenSpot Pro79.0%自报

Healthcare

VideoMMMU85.4%自报

Image To Text

OCRBench_V267.1%自报

Knowledge

MedXpertQA-MM71.0%自报
BC-VL51.1%自报
MMBC46.3%自报

Language

WMT24++84.6%自报
LingoQA83.4%自报

Long Context

MLVU87.4%自报
LVBench76.2%自报

Math

HMMT Feb 2692.9%自报
MathVision90.3%自报
IMO-AnswerBench86.0%自报
PolyMATH84.0%自报
Humanity's Last Exam34.7%自报
CritPT6.0%自报

Multimodal

OmniDocBench 1.591.4%自报
Video-MME88.0%自报
CharXiv-R85.9%自报
HiPhO84.1%自报
TVBench78.2%自报
VLADBench77.2%自报
SURDS77.2%自报
CountQA77.0%自报
BabyVision70.4%自报
WorldVQA61.1%自报
VisFactor42.8%自报

Reasoning

ERQA69.8%自报
Apex22.7%自报

Spatial Reasoning

RealWorldQA86.9%自报

Vision

ODinW51.1%自报

AA 评测指数

暂无 AA 评测数据

LLM Stats 分类评分

Legal
100
Finance
100
Agents
64
General
53
Reasoning
30
Structured Output
90
Instruction Following
90
Language
90
Long Context
90
Productivity
90
Video
90
Spatial Reasoning
80
Multimodal
80
Physics
80
Frontend Development
80
Grounding
80
Healthcare
80
Vision
80
Image To Text
70
Math
70
Biology
70
Chemistry
70
Code
70
Economics
70
Tool Calling
70
Coding
50

定价

输入价格$0.5 / 1M tokens
输出价格$3 / 1M tokens
混合价格(3:1)$1.125 / 1M tokens
缓存读取价格$0.05 / 1M tokens
缓存写入价格$0.625 / 1M tokens

速度

暂无速度数据

供应商价格排行

供应商价格排行

6 个供应商

最便宜: NanoGPT最贵: Alibaba (China)
供应商输入输出
1NanoGPT最便宜
$0.4
$1.6
2OpenCode Go
$0.4
$1.6
3LLM Gateway
$0.4
$1.6
4Alibaba Cloud / Qwen Team主要
$0.5
$3
5Alibaba
$0.5
$3
6Alibaba (China)
$0.5
$3

比较该模型在不同 API 供应商之间的定价。

外部链接