跳轉到主要內容

Qwen3.7-Plus

Alibaba Cloud / Qwen TeamQwenProprietary

描述

Qwen3.7-Plus is Alibaba Cloud Qwen Team's multimodal agent model that unifies vision and language into a single agent foundation. Built on the Qwen3.7 text backbone, it operates as a multimodal interactive hybrid agent—perceiving real-world scenes, reading screens and operating GUIs, writing code from visual references, navigating mobile apps end-to-end, and answering search-augmented visual questions—while blending GUI and CLI interactions within a single agent loop. It is a versatile coding agent and productivity assistant with full-modality input, generalizing across scaffolds such as Claude Code, OpenClaw, and Qwen Code. Features a 1 million token context window, up to 65,536 output tokens, always-on thinking, and a preserve_thinking mode for agentic tasks. Available via Alibaba Cloud Model Studio (DashScope).

發布日期
2026-05-31
參數規模
上下文長度
1.0M
支援模態
image, text, video

能力雷達圖

53
general
70
coding
70
reasoning
60
science估算
70
agents
90
multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域#排名分數來源
智慧體能力模型榜41
59.0
LS
多模態榜49
75.0
LS
推理能力70
58.0
LS

基準測試分數 (LLM Stats)

Agents

GDPval-AA946.00 / 3000自報
SpreadSheetBench-v186.3%自報
AndroidWorld81.0%自報
OSWorld-Verified73.3%自報
MCP Atlas73.2%自報
BFCL-V472.9%自報
Terminal-Bench 2.070.3%自報
CoWorkBench65.1%自報
Claw-Eval62.7%自報
DeepPlanning62.3%自報
QwenWorldBench62.1%自報
QwenClawBench61.8%自報
MCP-Mark58.7%自報
SWE-Bench Pro57.6%自報
ClawEval-MM55.7%自報
SkillsBench54.9%自報
VITA-Bench45.6%自報
MMSearch-Plus41.4%自報
NL2Repo41.1%自報
Finance Agent v238.2%自報

Biology

GPQA90.3%自報
SciCode51.3%自報

Chemistry

SuperGPQA71.4%自報

Code

SWE-Bench Verified77.7%自報
SWE-bench Multilingual75.8%自報

Finance

MMLU-Pro88.5%自報
MMLU-ProX85.4%自報

General

IFEval94.6%自報
MMLU-Redux94.5%自報
MRCR v291.7%自報
Global PIQA90.3%自報
LiveCodeBench v689.6%自報
MMMLU89.0%自報
MAXIFE88.8%自報
Include83.0%自報
SimpleVQA0.82 / 100自報
IFBench79.1%自報
MMMU-Pro79.0%自報
NOVA-6358.8%自報

Grounding

ScreenSpot Pro79.0%自報

Healthcare

VideoMMMU85.4%自報

Image To Text

OCRBench_V267.1%自報

Knowledge

MedXpertQA-MM71.0%自報
BC-VL51.1%自報
MMBC46.3%自報

Language

WMT24++84.6%自報
LingoQA83.4%自報

Long Context

MLVU87.4%自報
LVBench76.2%自報

Math

HMMT Feb 2692.9%自報
MathVision90.3%自報
IMO-AnswerBench86.0%自報
PolyMATH84.0%自報
Humanity's Last Exam34.7%自報
CritPT6.0%自報

Multimodal

OmniDocBench 1.591.4%自報
Video-MME88.0%自報
CharXiv-R85.9%自報
HiPhO84.1%自報
TVBench78.2%自報
VLADBench77.2%自報
SURDS77.2%自報
CountQA77.0%自報
BabyVision70.4%自報
WorldVQA61.1%自報
VisFactor42.8%自報

Reasoning

ERQA69.8%自報
Apex22.7%自報

Spatial Reasoning

RealWorldQA86.9%自報

Vision

ODinW51.1%自報

AA 評測指數

暫無 AA 評測資料

LLM Stats 分類評分

Legal
100
Finance
100
Agents
64
General
53
Reasoning
30
Structured Output
90
Instruction Following
90
Language
90
Long Context
90
Productivity
90
Video
90
Spatial Reasoning
80
Multimodal
80
Physics
80
Frontend Development
80
Grounding
80
Healthcare
80
Vision
80
Image To Text
70
Math
70
Biology
70
Chemistry
70
Code
70
Economics
70
Tool Calling
70
Coding
50

定價

輸入價格$0.5 / 1M tokens
輸出價格$3 / 1M tokens
混合價格(3:1)$1.125 / 1M tokens
快取讀取價格$0.05 / 1M tokens
快取寫入價格$0.625 / 1M tokens

速度

暫無速度資料

供應商價格排行

供應商價格排行

6 個供應商

最便宜: NanoGPT最貴: Alibaba (China)
供應商輸入輸出
1NanoGPT最便宜
$0.4
$1.6
2OpenCode Go
$0.4
$1.6
3LLM Gateway
$0.4
$1.6
4Alibaba Cloud / Qwen Team主要
$0.5
$3
5Alibaba
$0.5
$3
6Alibaba (China)
$0.5
$3

比較該模型在不同 API 供應商之間的定價。

外部連結