Claude Opus 4.6 (Non-reasoning, High Effort)
AnthropicClaudeProprietary
Описание
Claude Opus 4.6 is Anthropic's most intelligent model, improving on its predecessor's coding skills with more careful planning, longer agentic task sustenance, more reliable operation in larger codebases, and better code review and debugging skills. First Opus-class model with 1M token context window (beta), 128K output tokens, and adaptive thinking. Features effort controls (low/medium/high/max) and context compaction for long-running tasks. State-of-the-art on Terminal-Bench 2.0, Humanity's Last Exam, GDPval-AA, and BrowseComp. Pricing: $5/$25 per million tokens (input/output).
Дата выхода
2026-02-05
Параметры
—
Длина контекста
1.0M
Модальности
image, text
Радар способностей
41
general
47
coding
84
reasoning
58
scienceоцен.
80
agents
80
multimodal
Science использует прокси на основе рассуждений, когда специализированные научные бенчмарки недоступны.
Рейтинги
| Домен | #Место | Оценка | Источник |
|---|---|---|---|
| Agents & Tools | 17 | 68.0 | LS |
| Code Ranking | 26 | 80.0 | AA |
| General Ranking | 86 | 71.0 | AA |
| Multimodal Ranking | 37 | 77.0 | LS |
| Reasoning | 46 | 69.0 | LS |
| Science | 58 | 69.0 | AA |
Оценки бенчмарков (LLM Stats)
Agents
Vending-Bench 2
801759.0%Сам.
GDPval-AA
1606.00 / 3000Сам.
DeepSearchQA
91.3%Сам.
BrowseComp
84.0%Сам.
CyberGym
73.8%Сам.
OSWorld
72.7%Сам.
Terminal-Bench 2.0
65.4%Сам.
MCP Atlas
62.7%Сам.
Finance Agent
60.7%Сам.
OpenRCA
34.9%Сам.
Biology
GPQA
91.3%Сам.
Code
SWE-Bench Verified
80.8%Сам.
SWE-bench Multilingual
77.8%Сам.
Communication
Tau2 Telecom
99.3%Сам.
Tau2 Retail
91.9%Сам.
General
MRCR v2 (8-needle)
93.0%Сам.
MMMLU
91.1%Сам.
MMMU-Pro
77.3%Сам.
Healthcare
FigQA
78.3%Сам.
Long Context
Graphwalks parents >128k
95.4%Сам.
Graphwalks BFS >128k
61.5%Сам.
Math
AIME 2025
99.8%Сам.
Humanity's Last Exam
53.1%Сам.
Multimodal
CharXiv-R
77.4%Сам.
Reasoning
ARC-AGI v2
68.8%Сам.
Индексы оценки AA
Coding Index47.6
Intelligence Index46.5
Tau20.8
Gpqa0.8
Lcr0.6
Terminalbench Hard0.5
Scicode0.5
Ifbench0.4
Hle0.2
Оценки категорий LLM Stats
Legal100
Agents100
Finance100
Reasoning100
General100
Communication100
Biology90
Chemistry90
Language90
Physics90
Search90
Spatial Reasoning80
Tool Calling80
Frontend Development80
Healthcare80
Long Context80
Math80
Multimodal80
Safety80
Vision70
Code70
Цены
Цена ввода$6.25 / 1M tokens
Цена вывода$25 / 1M tokens
Смешанная цена (3:1)$10.938 / 1M tokens
Скорость
Токенов/сек49.0 tokens/s
Задержка первого токена1.44s
Время до первого ответа1.44s
Доступные провайдеры
(Внутренние единицы LS)| Провайдер | Цена ввода | Цена вывода |
|---|---|---|
| Anthropic | 5.0M | 25.0M |