Saltar al contenido principal

Qwen3.7-Plus

Alibaba Cloud / Qwen TeamQwenProprietary

Descripción

Qwen3.7-Plus is Alibaba Cloud Qwen Team's multimodal agent model that unifies vision and language into a single agent foundation. Built on the Qwen3.7 text backbone, it operates as a multimodal interactive hybrid agent—perceiving real-world scenes, reading screens and operating GUIs, writing code from visual references, navigating mobile apps end-to-end, and answering search-augmented visual questions—while blending GUI and CLI interactions within a single agent loop. It is a versatile coding agent and productivity assistant with full-modality input, generalizing across scaffolds such as Claude Code, OpenClaw, and Qwen Code. Features a 1 million token context window, up to 65,536 output tokens, always-on thinking, and a preserve_thinking mode for agentic tasks. Available via Alibaba Cloud Model Studio (DashScope).

Fecha de lanzamiento
2026-05-31
Parámetros
Longitud del contexto
1.0M
Modalidades
image, text, video

Radar de capacidades

53
general
70
coding
70
reasoning
60
scienceest.
70
agents
90
multimodal

Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.

Rankings

Dominio#PosiciónPuntuaciónFuente
Capacidad agéntica41
59.0
LS
Ranking multimodal49
75.0
LS
Razonamiento70
58.0
LS

Puntuaciones de benchmarks (LLM Stats)

Agents

GDPval-AA946.00 / 3000Aut.
SpreadSheetBench-v186.3%Aut.
AndroidWorld81.0%Aut.
OSWorld-Verified73.3%Aut.
MCP Atlas73.2%Aut.
BFCL-V472.9%Aut.
Terminal-Bench 2.070.3%Aut.
CoWorkBench65.1%Aut.
Claw-Eval62.7%Aut.
DeepPlanning62.3%Aut.
QwenWorldBench62.1%Aut.
QwenClawBench61.8%Aut.
MCP-Mark58.7%Aut.
SWE-Bench Pro57.6%Aut.
ClawEval-MM55.7%Aut.
SkillsBench54.9%Aut.
VITA-Bench45.6%Aut.
MMSearch-Plus41.4%Aut.
NL2Repo41.1%Aut.
Finance Agent v238.2%Aut.

Biology

GPQA90.3%Aut.
SciCode51.3%Aut.

Chemistry

SuperGPQA71.4%Aut.

Code

SWE-Bench Verified77.7%Aut.
SWE-bench Multilingual75.8%Aut.

Finance

MMLU-Pro88.5%Aut.
MMLU-ProX85.4%Aut.

General

IFEval94.6%Aut.
MMLU-Redux94.5%Aut.
MRCR v291.7%Aut.
Global PIQA90.3%Aut.
LiveCodeBench v689.6%Aut.
MMMLU89.0%Aut.
MAXIFE88.8%Aut.
Include83.0%Aut.
SimpleVQA0.82 / 100Aut.
IFBench79.1%Aut.
MMMU-Pro79.0%Aut.
NOVA-6358.8%Aut.

Grounding

ScreenSpot Pro79.0%Aut.

Healthcare

VideoMMMU85.4%Aut.

Image To Text

OCRBench_V267.1%Aut.

Knowledge

MedXpertQA-MM71.0%Aut.
BC-VL51.1%Aut.
MMBC46.3%Aut.

Language

WMT24++84.6%Aut.
LingoQA83.4%Aut.

Long Context

MLVU87.4%Aut.
LVBench76.2%Aut.

Math

HMMT Feb 2692.9%Aut.
MathVision90.3%Aut.
IMO-AnswerBench86.0%Aut.
PolyMATH84.0%Aut.
Humanity's Last Exam34.7%Aut.
CritPT6.0%Aut.

Multimodal

OmniDocBench 1.591.4%Aut.
Video-MME88.0%Aut.
CharXiv-R85.9%Aut.
HiPhO84.1%Aut.
TVBench78.2%Aut.
VLADBench77.2%Aut.
SURDS77.2%Aut.
CountQA77.0%Aut.
BabyVision70.4%Aut.
WorldVQA61.1%Aut.
VisFactor42.8%Aut.

Reasoning

ERQA69.8%Aut.
Apex22.7%Aut.

Spatial Reasoning

RealWorldQA86.9%Aut.

Vision

ODinW51.1%Aut.

Índices de evaluación AA

No hay datos de evaluación AA disponibles

Puntuaciones por categoría LLM Stats

Legal
100
Finance
100
Agents
64
General
53
Reasoning
30
Structured Output
90
Instruction Following
90
Language
90
Long Context
90
Productivity
90
Video
90
Spatial Reasoning
80
Multimodal
80
Physics
80
Frontend Development
80
Grounding
80
Healthcare
80
Vision
80
Image To Text
70
Math
70
Biology
70
Chemistry
70
Code
70
Economics
70
Tool Calling
70
Coding
50

Precios

Precio de entrada$0.5 / 1M tokens
Precio de salida$3 / 1M tokens
Precio mixto (3:1)$1.125 / 1M tokens
Precio de lectura caché$0.05 / 1M tokens
Precio de escritura caché$0.625 / 1M tokens

Velocidad

No hay datos de velocidad disponibles

Ranking de Precios por Proveedor

Ranking de Precios por Proveedor

6 proveedores

Más barato: NanoGPTMás caro: Alibaba (China)
ProveedorEntradaSalida
1NanoGPTMás barato
$0.4
$1.6
2OpenCode Go
$0.4
$1.6
3LLM Gateway
$0.4
$1.6
4Alibaba Cloud / Qwen TeamPRINCIPAL
$0.5
$3
5Alibaba
$0.5
$3
6Alibaba (China)
$0.5
$3

Comparar precios entre diferentes proveedores de API para este modelo.

Fuentes externas