Перейти к основному содержанию

Qwen3.7-Plus

Alibaba Cloud / Qwen TeamQwenProprietary

Описание

Qwen3.7-Plus is Alibaba Cloud Qwen Team's multimodal agent model that unifies vision and language into a single agent foundation. Built on the Qwen3.7 text backbone, it operates as a multimodal interactive hybrid agent—perceiving real-world scenes, reading screens and operating GUIs, writing code from visual references, navigating mobile apps end-to-end, and answering search-augmented visual questions—while blending GUI and CLI interactions within a single agent loop. It is a versatile coding agent and productivity assistant with full-modality input, generalizing across scaffolds such as Claude Code, OpenClaw, and Qwen Code. Features a 1 million token context window, up to 65,536 output tokens, always-on thinking, and a preserve_thinking mode for agentic tasks. Available via Alibaba Cloud Model Studio (DashScope).

Дата выхода
2026-05-31
Параметры
Длина контекста
1.0M
Модальности
image, text, video

Радар способностей

53
general
70
coding
70
reasoning
60
scienceоцен.
70
agents
90
multimodal

Science использует прокси на основе рассуждений, когда специализированные научные бенчмарки недоступны.

Рейтинги

Домен#МестоОценкаИсточник
Агентные возможности41
59.0
LS
Мультимодальный рейтинг49
75.0
LS
Рассуждения70
58.0
LS

Оценки бенчмарков (LLM Stats)

Agents

GDPval-AA946.00 / 3000Сам.
SpreadSheetBench-v186.3%Сам.
AndroidWorld81.0%Сам.
OSWorld-Verified73.3%Сам.
MCP Atlas73.2%Сам.
BFCL-V472.9%Сам.
Terminal-Bench 2.070.3%Сам.
CoWorkBench65.1%Сам.
Claw-Eval62.7%Сам.
DeepPlanning62.3%Сам.
QwenWorldBench62.1%Сам.
QwenClawBench61.8%Сам.
MCP-Mark58.7%Сам.
SWE-Bench Pro57.6%Сам.
ClawEval-MM55.7%Сам.
SkillsBench54.9%Сам.
VITA-Bench45.6%Сам.
MMSearch-Plus41.4%Сам.
NL2Repo41.1%Сам.
Finance Agent v238.2%Сам.

Biology

GPQA90.3%Сам.
SciCode51.3%Сам.

Chemistry

SuperGPQA71.4%Сам.

Code

SWE-Bench Verified77.7%Сам.
SWE-bench Multilingual75.8%Сам.

Finance

MMLU-Pro88.5%Сам.
MMLU-ProX85.4%Сам.

General

IFEval94.6%Сам.
MMLU-Redux94.5%Сам.
MRCR v291.7%Сам.
Global PIQA90.3%Сам.
LiveCodeBench v689.6%Сам.
MMMLU89.0%Сам.
MAXIFE88.8%Сам.
Include83.0%Сам.
SimpleVQA0.82 / 100Сам.
IFBench79.1%Сам.
MMMU-Pro79.0%Сам.
NOVA-6358.8%Сам.

Grounding

ScreenSpot Pro79.0%Сам.

Healthcare

VideoMMMU85.4%Сам.

Image To Text

OCRBench_V267.1%Сам.

Knowledge

MedXpertQA-MM71.0%Сам.
BC-VL51.1%Сам.
MMBC46.3%Сам.

Language

WMT24++84.6%Сам.
LingoQA83.4%Сам.

Long Context

MLVU87.4%Сам.
LVBench76.2%Сам.

Math

HMMT Feb 2692.9%Сам.
MathVision90.3%Сам.
IMO-AnswerBench86.0%Сам.
PolyMATH84.0%Сам.
Humanity's Last Exam34.7%Сам.
CritPT6.0%Сам.

Multimodal

OmniDocBench 1.591.4%Сам.
Video-MME88.0%Сам.
CharXiv-R85.9%Сам.
HiPhO84.1%Сам.
TVBench78.2%Сам.
VLADBench77.2%Сам.
SURDS77.2%Сам.
CountQA77.0%Сам.
BabyVision70.4%Сам.
WorldVQA61.1%Сам.
VisFactor42.8%Сам.

Reasoning

ERQA69.8%Сам.
Apex22.7%Сам.

Spatial Reasoning

RealWorldQA86.9%Сам.

Vision

ODinW51.1%Сам.

Индексы оценки AA

Нет данных AA оценки

Оценки категорий LLM Stats

Legal
100
Finance
100
Agents
64
General
53
Reasoning
30
Structured Output
90
Instruction Following
90
Language
90
Long Context
90
Productivity
90
Video
90
Spatial Reasoning
80
Multimodal
80
Physics
80
Frontend Development
80
Grounding
80
Healthcare
80
Vision
80
Image To Text
70
Math
70
Biology
70
Chemistry
70
Code
70
Economics
70
Tool Calling
70
Coding
50

Цены

Цена ввода$0.5 / 1M токенов
Цена вывода$3 / 1M токенов
Смешанная цена (3:1)$1.125 / 1M токенов
Цена чтения кэша$0.05 / 1M токенов
Цена записи кэша$0.625 / 1M токенов

Скорость

Нет данных о скорости

Рейтинг цен провайдеров

Рейтинг цен провайдеров

6 провайдеров

Самый дешевый: NanoGPTСамый дорогой: Alibaba (China)
ПровайдерВводВывод
1NanoGPTСамый дешевый
$0.4
$1.6
2OpenCode Go
$0.4
$1.6
3LLM Gateway
$0.4
$1.6
4Alibaba Cloud / Qwen TeamОсновной
$0.5
$3
5Alibaba
$0.5
$3
6Alibaba (China)
$0.5
$3

Сравнение цен разных API-провайдеров для этой модели.

Внешние ссылки