跳轉到主要內容

Claude Sonnet 5 (Adaptive Reasoning, Max Effort)

AnthropicClaude

描述

Claude Sonnet 5 is Anthropic's most agentic Sonnet-class model, an upgrade to Sonnet 4.6 that narrows the gap to Opus 4.8 on reasoning, tool use, coding, computer use, and knowledge work while staying lower priced. It plans, uses tools like browsers and terminals, and runs autonomously for long-horizon tasks. Capability gains include SWE-Bench Verified (85.2%), SWE-Bench Pro (63.2%), SWE-Bench Multilingual (78.3%), Terminal-Bench 2.1 (80.4%), OSWorld-Verified (81.2%), BrowseComp (84.7% single-agent, 86.6% multi-agent), Humanity's Last Exam with tools (57.4%), USAMO 2026 (79.5%), GDPval-AA v2 (1618 Elo), HealthBench Professional (57.8%), and FrontierCode v1 (38.8%). It supports adaptive thinking with selectable effort levels up to 'extra high' (xhigh) and a 1M-token context window with context compaction. The safety assessment found lower rates of misaligned behavior, hallucination, and sycophancy than Sonnet 4.6, with improved prompt-injection robustness; it ships with cyber safeguards enabled by default and uses an updated tokenizer (input maps to roughly 1.0-1.35x more tokens than Sonnet 4.6). Default model on Free and Pro plans and available to Max, Team, and Enterprise users, in Claude Code, and on the Claude Platform. Launches with introductory pricing of $2/$10 per million input/output tokens through August 31, 2026, then $3/$15. Available via the Claude API as `claude-sonnet-5`.

發布日期
2026-06-30
參數規模
上下文長度
1.0M
支援模態
image, pdf, text

能力雷達圖

50
general
69
coding
91
reasoning
68
science估算
70
agents
70
multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域#排名分數來源
程式碼能力榜7
93.0
AA
通用能力榜5
89.0
AA
科學能力13
86.0
AA

基準測試分數 (LLM Stats)

Agents

GDPval-AA1618.00 / 3000自報
BrowseComp84.7%自報
OSWorld-Verified81.2%自報
Terminal-Bench 2.080.4%自報
SWE-Bench Pro63.2%自報
OfficeQA Pro59.4%自報
Toolathlon54.3%自報
FrontierCode38.8%自報
SWE-Bench Multimodal28.1%自報
AutomationBench13.5%自報
Legal Agent Benchmark5.8%自報

Code

SWE-Bench Verified85.2%自報
SWE-bench Multilingual78.3%自報
BenchCAD37.3%自報

General

GDP.pdf81.6%自報

Healthcare

HealthBench Professional57.8%自報

Math

USAMO 202633.39 / 42自報
ArXivMath72.2%自報
Humanity's Last Exam57.4%自報

Multimodal

CharXiv-R88.3%自報
ChartMuseum86.7%自報

AA 評測指數

Coding Index
71.5
Intelligence Index
53.4
Gpqa
0.9
Terminalbench V2 1
0.8
Lcr
0.7
Scicode
0.5
Hle
0.4
Tau Banking
0.3

LLM Stats 分類評分

Finance
100
Legal
100
General
100
Agents
100
Reasoning
100
Frontend Development
90
Search
80
Multimodal
70
Code
70
Tool Calling
70
Math
60
Healthcare
60
Vision
60

定價

輸入價格免費
輸出價格免費
混合價格(3:1)免費
快取讀取價格$0.2 / 1M tokens
快取寫入價格$2.5 / 1M tokens

速度

Tokens/秒0.0
首Token延遲0.00s
首回答延遲0.00s

供應商價格排行

供應商價格排行

9 個供應商

最便宜: Amazon Bedrock最貴: Cortecs
供應商輸入輸出
1Amazon Bedrock最便宜
$2
$10
2Vertex (Anthropic)
$2
$10
3Vertex
$2
$10
4Poe
$2.6
$13
5NanoGPT
$2.992
$14.994
6OpenRouter
$3
$15
7Kilo Gateway
$3
$15
8DigitalOcean
$3
$15
9Cortecs
$3.59
$17.92

比較該模型在不同 API 供應商之間的定價。

外部連結