Claude 3.7 Sonnet (Reasoning)
AnthropicClaude
Description
The most intelligent Claude model and the first hybrid reasoning model on the market. Claude 3.7 Sonnet can produce near-instant responses or extended, step-by-step thinking that is made visible to the user. Shows particularly strong improvements in coding and front-end web development.
Release Date
2025-02-24
Parameters
—
Context Length
200K
Modalities
image, pdf, text
Capability Radar
42
general
41
coding
62
reasoning
51
scienceest.
70
agents
80
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Agentic Capability | 111 | 35.0 | LS |
| Code Ranking | 170 | 52.0 | AA |
| General Ranking | 148 | 57.0 | AA |
| Math Reasoning | 145 | 63.0 | AA |
| Science | 148 | 55.0 | AA |
Benchmark Scores (LLM Stats)
Agents
Terminal-Bench
35.2%SR
Biology
GPQA
84.8%SR
Code
SWE-Bench Verified
70.3%SR
Communication
TAU-bench Retail
81.2%SR
TAU-bench Airline
58.4%SR
General
IFEval
93.2%SR
MMMLU
86.1%SR
MMMU
75.0%SR
Math
MATH-500
96.2%SR
AIME 2024
80.0%SR
AIME 2025
54.8%SR
AA Evaluation Indices
Math Index56.3
Coding Index36.4
Intelligence Index27.1
Math 5000.9
Mmlu Pro0.8
Gpqa0.8
Lcr0.6
Aime 250.6
Tau20.5
Aime0.5
Ifbench0.5
Livecodebench0.5
Scicode0.4
Terminalbench Hard0.2
Hle0.1
LLM Stats Category Scores
Instruction Following90
Language90
Structured Output90
Math80
Multimodal80
Physics80
General80
Healthcare80
Biology80
Chemistry80
Vision80
Reasoning70
Frontend Development70
Communication70
Tool Calling70
Code50
Agents40
Pricing
Input PriceFree
Output PriceFree
Blended Price (3:1)Free
Cache Read Price$0.3 / 1M tokens
Cache Write Price$3.75 / 1M tokens
Speed
Tokens/sec0.0
Time to First Token0.00s
Time to Answer0.00s
Provider Price Ranking
Provider Price Ranking
3 providers
Cheapest: AbacusMost Expensive: Anthropic
ProviderInputOutput
1AbacusCheapest
$3
$15
2LLM Gateway
$3
$15
3Anthropic
$3
$15
Compare pricing across different API providers for this model.