Claude Sonnet 5 (Adaptive Reasoning, Max Effort)

AnthropicClaude

Описание

Claude Sonnet 5 is Anthropic's most agentic Sonnet-class model, an upgrade to Sonnet 4.6 that narrows the gap to Opus 4.8 on reasoning, tool use, coding, computer use, and knowledge work while staying lower priced. It plans, uses tools like browsers and terminals, and runs autonomously for long-horizon tasks. Capability gains include SWE-Bench Verified (85.2%), SWE-Bench Pro (63.2%), SWE-Bench Multilingual (78.3%), Terminal-Bench 2.1 (80.4%), OSWorld-Verified (81.2%), BrowseComp (84.7% single-agent, 86.6% multi-agent), Humanity's Last Exam with tools (57.4%), USAMO 2026 (79.5%), GDPval-AA v2 (1618 Elo), HealthBench Professional (57.8%), and FrontierCode v1 (38.8%). It supports adaptive thinking with selectable effort levels up to 'extra high' (xhigh) and a 1M-token context window with context compaction. The safety assessment found lower rates of misaligned behavior, hallucination, and sycophancy than Sonnet 4.6, with improved prompt-injection robustness; it ships with cyber safeguards enabled by default and uses an updated tokenizer (input maps to roughly 1.0-1.35x more tokens than Sonnet 4.6). Default model on Free and Pro plans and available to Max, Team, and Enterprise users, in Claude Code, and on the Claude Platform. Launches with introductory pricing of $2/$10 per million input/output tokens through August 31, 2026, then $3/$15. Available via the Claude API as `claude-sonnet-5`.

Дата выхода

2026-06-30

Параметры

—

Длина контекста

1.0M

Модальности

image, pdf, text

Радар способностей

general

coding

reasoning

scienceоцен.

agents

multimodal

Science использует прокси на основе рассуждений, когда специализированные научные бенчмарки недоступны.

Рейтинги

Домен	#Место	Оценка	Источник
Рейтинг кодинга	7	93.0	AA
Общий рейтинг	5	89.0	AA
Наука	13	86.0	AA

Оценки бенчмарков (LLM Stats)

Agents

GDPval-AA

1618.00 / 3000Сам.

BrowseComp

84.7%Сам.

OSWorld-Verified

81.2%Сам.

Terminal-Bench 2.0

80.4%Сам.

SWE-Bench Pro

63.2%Сам.

OfficeQA Pro

59.4%Сам.

Toolathlon

54.3%Сам.

FrontierCode

38.8%Сам.

SWE-Bench Multimodal

28.1%Сам.

AutomationBench

13.5%Сам.

Legal Agent Benchmark

5.8%Сам.

Code

SWE-Bench Verified

85.2%Сам.

SWE-bench Multilingual

78.3%Сам.

BenchCAD

37.3%Сам.

General

GDP.pdf

81.6%Сам.

Healthcare

HealthBench Professional

57.8%Сам.

Math

USAMO 2026

33.39 / 42Сам.

ArXivMath

72.2%Сам.

Humanity's Last Exam

57.4%Сам.

Multimodal

CharXiv-R

88.3%Сам.

ChartMuseum

86.7%Сам.

Индексы оценки AA

Coding Index

71.5

Intelligence Index

53.4

Gpqa

0.9

Terminalbench V2 1

0.8

Lcr

0.7

Scicode

0.5

Hle

0.4

Tau Banking

0.3

Оценки категорий LLM Stats

Finance

100

Legal

100

General

100

Agents

100

Reasoning

100

Frontend Development

Multimodal

Code

Tool Calling

Math

Healthcare

Vision

Цены

Цена вводаБесплатно

Цена выводаБесплатно

Смешанная цена (3:1)Бесплатно

Цена чтения кэша$0.2 / 1M токенов

Цена записи кэша$2.5 / 1M токенов

Скорость

Токенов/сек0.0

Задержка первого токена0.00s

Время до первого ответа0.00s

Рейтинг цен провайдеров

9 провайдеров

Самый дешевый: Amazon BedrockСамый дорогой: Cortecs

ПровайдерВводВывод

1Amazon BedrockСамый дешевый

$10

2Vertex (Anthropic)

$10

3Vertex

$10

4Poe

$2.6

$13

5NanoGPT

$2.992

$14.994

6OpenRouter

$15

7Kilo Gateway

$15

8DigitalOcean

$15

9Cortecs

$3.59

$17.92

Сравнение цен разных API-провайдеров для этой модели.

Внешние ссылки

Artificial Analysis