메인 콘텐츠로 건너뛰기

Claude Sonnet 5 (Adaptive Reasoning, Max Effort)

AnthropicClaude

설명

Claude Sonnet 5 is Anthropic's most agentic Sonnet-class model, an upgrade to Sonnet 4.6 that narrows the gap to Opus 4.8 on reasoning, tool use, coding, computer use, and knowledge work while staying lower priced. It plans, uses tools like browsers and terminals, and runs autonomously for long-horizon tasks. Capability gains include SWE-Bench Verified (85.2%), SWE-Bench Pro (63.2%), SWE-Bench Multilingual (78.3%), Terminal-Bench 2.1 (80.4%), OSWorld-Verified (81.2%), BrowseComp (84.7% single-agent, 86.6% multi-agent), Humanity's Last Exam with tools (57.4%), USAMO 2026 (79.5%), GDPval-AA v2 (1618 Elo), HealthBench Professional (57.8%), and FrontierCode v1 (38.8%). It supports adaptive thinking with selectable effort levels up to 'extra high' (xhigh) and a 1M-token context window with context compaction. The safety assessment found lower rates of misaligned behavior, hallucination, and sycophancy than Sonnet 4.6, with improved prompt-injection robustness; it ships with cyber safeguards enabled by default and uses an updated tokenizer (input maps to roughly 1.0-1.35x more tokens than Sonnet 4.6). Default model on Free and Pro plans and available to Max, Team, and Enterprise users, in Claude Code, and on the Claude Platform. Launches with introductory pricing of $2/$10 per million input/output tokens through August 31, 2026, then $3/$15. Available via the Claude API as `claude-sonnet-5`.

출시일
2026-06-30
파라미터
컨텍스트 길이
1.0M
모달리티
image, pdf, text

능력 레이더

50
general
69
coding
91
reasoning
68
science추정
70
agents
70
multimodal

전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.

랭킹

도메인#순위점수소스
코딩 랭킹7
93.0
AA
종합 랭킹5
89.0
AA
과학13
86.0
AA

벤치마크 점수 (LLM Stats)

Agents

GDPval-AA1618.00 / 3000자체 보고
BrowseComp84.7%자체 보고
OSWorld-Verified81.2%자체 보고
Terminal-Bench 2.080.4%자체 보고
SWE-Bench Pro63.2%자체 보고
OfficeQA Pro59.4%자체 보고
Toolathlon54.3%자체 보고
FrontierCode38.8%자체 보고
SWE-Bench Multimodal28.1%자체 보고
AutomationBench13.5%자체 보고
Legal Agent Benchmark5.8%자체 보고

Code

SWE-Bench Verified85.2%자체 보고
SWE-bench Multilingual78.3%자체 보고
BenchCAD37.3%자체 보고

General

GDP.pdf81.6%자체 보고

Healthcare

HealthBench Professional57.8%자체 보고

Math

USAMO 202633.39 / 42자체 보고
ArXivMath72.2%자체 보고
Humanity's Last Exam57.4%자체 보고

Multimodal

CharXiv-R88.3%자체 보고
ChartMuseum86.7%자체 보고

AA 평가 지수

Coding Index
71.5
Intelligence Index
53.4
Gpqa
0.9
Terminalbench V2 1
0.8
Lcr
0.7
Scicode
0.5
Hle
0.4
Tau Banking
0.3

LLM Stats 카테고리 점수

Finance
100
Legal
100
General
100
Agents
100
Reasoning
100
Frontend Development
90
Search
80
Multimodal
70
Code
70
Tool Calling
70
Math
60
Healthcare
60
Vision
60

가격

입력 가격무료
출력 가격무료
혼합 가격 (3:1)무료
캐시 읽기 가격$0.2 / 1M 토큰
캐시 쓰기 가격$2.5 / 1M 토큰

속도

토큰/초0.0
첫 토큰 지연0.00s
첫 응답 지연0.00s

공급자 가격 순위

공급자 가격 순위

9개 공급자

최저가: Amazon Bedrock최고가: Cortecs
공급자입력출력
1Amazon Bedrock최저가
$2
$10
2Vertex (Anthropic)
$2
$10
3Vertex
$2
$10
4Poe
$2.6
$13
5NanoGPT
$2.992
$14.994
6OpenRouter
$3
$15
7Kilo Gateway
$3
$15
8DigitalOcean
$3
$15
9Cortecs
$3.59
$17.92

이 모델의 다양한 API 공급자 간 가격 비교.

외부 링크