Passer au contenu principal

Qwen3.7-Plus

Alibaba Cloud / Qwen TeamQwenProprietary

Description

Qwen3.7-Plus is Alibaba Cloud Qwen Team's multimodal agent model that unifies vision and language into a single agent foundation. Built on the Qwen3.7 text backbone, it operates as a multimodal interactive hybrid agent—perceiving real-world scenes, reading screens and operating GUIs, writing code from visual references, navigating mobile apps end-to-end, and answering search-augmented visual questions—while blending GUI and CLI interactions within a single agent loop. It is a versatile coding agent and productivity assistant with full-modality input, generalizing across scaffolds such as Claude Code, OpenClaw, and Qwen Code. Features a 1 million token context window, up to 65,536 output tokens, always-on thinking, and a preserve_thinking mode for agentic tasks. Available via Alibaba Cloud Model Studio (DashScope).

Date de sortie
2026-05-31
Paramètres
Longueur du contexte
1.0M
Modalités
image, text, video

Radar de capacités

53
general
70
coding
70
reasoning
60
scienceest.
70
agents
90
multimodal

Science utilise un proxy de raisonnement lorsque les benchmarks scientifiques dédiés ne sont pas disponibles.

Classements

Domaine#RangScoreSource
Capacité agentique41
59.0
LS
Classement multimodal49
75.0
LS
Raisonnement70
58.0
LS

Scores de benchmarks (LLM Stats)

Agents

GDPval-AA946.00 / 3000Aut.
SpreadSheetBench-v186.3%Aut.
AndroidWorld81.0%Aut.
OSWorld-Verified73.3%Aut.
MCP Atlas73.2%Aut.
BFCL-V472.9%Aut.
Terminal-Bench 2.070.3%Aut.
CoWorkBench65.1%Aut.
Claw-Eval62.7%Aut.
DeepPlanning62.3%Aut.
QwenWorldBench62.1%Aut.
QwenClawBench61.8%Aut.
MCP-Mark58.7%Aut.
SWE-Bench Pro57.6%Aut.
ClawEval-MM55.7%Aut.
SkillsBench54.9%Aut.
VITA-Bench45.6%Aut.
MMSearch-Plus41.4%Aut.
NL2Repo41.1%Aut.
Finance Agent v238.2%Aut.

Biology

GPQA90.3%Aut.
SciCode51.3%Aut.

Chemistry

SuperGPQA71.4%Aut.

Code

SWE-Bench Verified77.7%Aut.
SWE-bench Multilingual75.8%Aut.

Finance

MMLU-Pro88.5%Aut.
MMLU-ProX85.4%Aut.

General

IFEval94.6%Aut.
MMLU-Redux94.5%Aut.
MRCR v291.7%Aut.
Global PIQA90.3%Aut.
LiveCodeBench v689.6%Aut.
MMMLU89.0%Aut.
MAXIFE88.8%Aut.
Include83.0%Aut.
SimpleVQA0.82 / 100Aut.
IFBench79.1%Aut.
MMMU-Pro79.0%Aut.
NOVA-6358.8%Aut.

Grounding

ScreenSpot Pro79.0%Aut.

Healthcare

VideoMMMU85.4%Aut.

Image To Text

OCRBench_V267.1%Aut.

Knowledge

MedXpertQA-MM71.0%Aut.
BC-VL51.1%Aut.
MMBC46.3%Aut.

Language

WMT24++84.6%Aut.
LingoQA83.4%Aut.

Long Context

MLVU87.4%Aut.
LVBench76.2%Aut.

Math

HMMT Feb 2692.9%Aut.
MathVision90.3%Aut.
IMO-AnswerBench86.0%Aut.
PolyMATH84.0%Aut.
Humanity's Last Exam34.7%Aut.
CritPT6.0%Aut.

Multimodal

OmniDocBench 1.591.4%Aut.
Video-MME88.0%Aut.
CharXiv-R85.9%Aut.
HiPhO84.1%Aut.
TVBench78.2%Aut.
VLADBench77.2%Aut.
SURDS77.2%Aut.
CountQA77.0%Aut.
BabyVision70.4%Aut.
WorldVQA61.1%Aut.
VisFactor42.8%Aut.

Reasoning

ERQA69.8%Aut.
Apex22.7%Aut.

Spatial Reasoning

RealWorldQA86.9%Aut.

Vision

ODinW51.1%Aut.

Indices d'évaluation AA

Aucune donnée d'évaluation AA disponible

Scores par catégorie LLM Stats

Legal
100
Finance
100
Agents
64
General
53
Reasoning
30
Structured Output
90
Instruction Following
90
Language
90
Long Context
90
Productivity
90
Video
90
Spatial Reasoning
80
Multimodal
80
Physics
80
Frontend Development
80
Grounding
80
Healthcare
80
Vision
80
Image To Text
70
Math
70
Biology
70
Chemistry
70
Code
70
Economics
70
Tool Calling
70
Coding
50

Tarification

Prix d'entrée$0.5 / 1M tokens
Prix de sortie$3 / 1M tokens
Prix mixte (3:1)$1.125 / 1M tokens
Prix de lecture cache$0.05 / 1M tokens
Prix d'écriture cache$0.625 / 1M tokens

Vitesse

Aucune donnée de vitesse disponible

Classement des Prix par Fournisseur

Classement des Prix par Fournisseur

6 fournisseurs

Moins cher: NanoGPTPlus cher: Alibaba (China)
FournisseurEntréeSortie
1NanoGPTMoins cher
$0.4
$1.6
2OpenCode Go
$0.4
$1.6
3LLM Gateway
$0.4
$1.6
4Alibaba Cloud / Qwen TeamPRINCIPAL
$0.5
$3
5Alibaba
$0.5
$3
6Alibaba (China)
$0.5
$3

Comparer les prix entre différents fournisseurs API pour ce modèle.

Sources externes