メインコンテンツへスキップ

Claude Sonnet 5 (Adaptive Reasoning, Max Effort)

AnthropicClaude

説明

Claude Sonnet 5 is Anthropic's most agentic Sonnet-class model, an upgrade to Sonnet 4.6 that narrows the gap to Opus 4.8 on reasoning, tool use, coding, computer use, and knowledge work while staying lower priced. It plans, uses tools like browsers and terminals, and runs autonomously for long-horizon tasks. Capability gains include SWE-Bench Verified (85.2%), SWE-Bench Pro (63.2%), SWE-Bench Multilingual (78.3%), Terminal-Bench 2.1 (80.4%), OSWorld-Verified (81.2%), BrowseComp (84.7% single-agent, 86.6% multi-agent), Humanity's Last Exam with tools (57.4%), USAMO 2026 (79.5%), GDPval-AA v2 (1618 Elo), HealthBench Professional (57.8%), and FrontierCode v1 (38.8%). It supports adaptive thinking with selectable effort levels up to 'extra high' (xhigh) and a 1M-token context window with context compaction. The safety assessment found lower rates of misaligned behavior, hallucination, and sycophancy than Sonnet 4.6, with improved prompt-injection robustness; it ships with cyber safeguards enabled by default and uses an updated tokenizer (input maps to roughly 1.0-1.35x more tokens than Sonnet 4.6). Default model on Free and Pro plans and available to Max, Team, and Enterprise users, in Claude Code, and on the Claude Platform. Launches with introductory pricing of $2/$10 per million input/output tokens through August 31, 2026, then $3/$15. Available via the Claude API as `claude-sonnet-5`.

リリース日
2026-06-30
パラメータ
コンテキスト長
1.0M
モダリティ
image, pdf, text

能力レーダー

50
general
69
coding
91
reasoning
68
science推定
70
agents
70
multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ランキング

ドメイン#順位スコアソース
コーディングランキング7
93.0
AA
総合ランキング5
89.0
AA
科学13
86.0
AA

ベンチマークスコア (LLM Stats)

Agents

GDPval-AA1618.00 / 3000自己申告
BrowseComp84.7%自己申告
OSWorld-Verified81.2%自己申告
Terminal-Bench 2.080.4%自己申告
SWE-Bench Pro63.2%自己申告
OfficeQA Pro59.4%自己申告
Toolathlon54.3%自己申告
FrontierCode38.8%自己申告
SWE-Bench Multimodal28.1%自己申告
AutomationBench13.5%自己申告
Legal Agent Benchmark5.8%自己申告

Code

SWE-Bench Verified85.2%自己申告
SWE-bench Multilingual78.3%自己申告
BenchCAD37.3%自己申告

General

GDP.pdf81.6%自己申告

Healthcare

HealthBench Professional57.8%自己申告

Math

USAMO 202633.39 / 42自己申告
ArXivMath72.2%自己申告
Humanity's Last Exam57.4%自己申告

Multimodal

CharXiv-R88.3%自己申告
ChartMuseum86.7%自己申告

AA評価指数

Coding Index
71.5
Intelligence Index
53.4
Gpqa
0.9
Terminalbench V2 1
0.8
Lcr
0.7
Scicode
0.5
Hle
0.4
Tau Banking
0.3

LLM Statsカテゴリスコア

Finance
100
Legal
100
General
100
Agents
100
Reasoning
100
Frontend Development
90
Search
80
Multimodal
70
Code
70
Tool Calling
70
Math
60
Healthcare
60
Vision
60

価格設定

入力価格無料
出力価格無料
混合価格(3:1)無料
キャッシュ読み取り価格$0.2 / 1Mトークン
キャッシュ書き込み価格$2.5 / 1Mトークン

速度

トークン/秒0.0
初トークン遅延0.00s
初回答遅延0.00s

プロバイダー価格ランキング

プロバイダー価格ランキング

9 プロバイダー

最安: Amazon Bedrock最高: Cortecs
プロバイダー入力出力
1Amazon Bedrock最安
$2
$10
2Vertex (Anthropic)
$2
$10
3Vertex
$2
$10
4Poe
$2.6
$13
5NanoGPT
$2.992
$14.994
6OpenRouter
$3
$15
7Kilo Gateway
$3
$15
8DigitalOcean
$3
$15
9Cortecs
$3.59
$17.92

このモデルの異なるAPIプロバイダー間の価格を比較。

外部リンク