Claude 3.7 Sonnet (Reasoning)

AnthropicClaude

Description

The most intelligent Claude model and the first hybrid reasoning model on the market. Claude 3.7 Sonnet can produce near-instant responses or extended, step-by-step thinking that is made visible to the user. Shows particularly strong improvements in coding and front-end web development.

Release Date

2025-02-24

Parameters

—

Context Length

200K

Modalities

image, pdf, text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Agentic Capability	111	35.0	LS
Code Ranking	170	52.0	AA
General Ranking	148	57.0	AA
Math Reasoning	145	63.0	AA
Science	148	55.0	AA

Benchmark Scores (LLM Stats)

Agents

Terminal-Bench

35.2%SR

Biology

GPQA

84.8%SR

Code

SWE-Bench Verified

70.3%SR

Communication

TAU-bench Retail

81.2%SR

TAU-bench Airline

58.4%SR

General

IFEval

93.2%SR

MMMLU

86.1%SR

MMMU

75.0%SR

Math

MATH-500

96.2%SR

AIME 2024

80.0%SR

AIME 2025

54.8%SR

AA Evaluation Indices

Math Index

56.3

Coding Index

36.4

Intelligence Index

27.1

Math 500

0.9

Mmlu Pro

0.8

Gpqa

0.8

Lcr

0.6

Aime 25

0.6

Tau2

0.5

Aime

0.5

Ifbench

0.5

Livecodebench

0.5

Scicode

0.4

Terminalbench Hard

0.2

Hle

0.1

LLM Stats Category Scores

Instruction Following

Language

Structured Output

Math

Multimodal

Physics

General

Healthcare

Biology

Chemistry

Vision

Reasoning

Frontend Development

Communication

Tool Calling

Code

Agents

Pricing

Input PriceFree

Output PriceFree

Blended Price (3:1)Free

Cache Read Price$0.3 / 1M tokens

Cache Write Price$3.75 / 1M tokens

Speed

Tokens/sec0.0

Time to First Token0.00s

Time to Answer0.00s

Provider Price Ranking

3 providers

Cheapest: AbacusMost Expensive: Anthropic

ProviderInputOutput

1AbacusCheapest

$15

2LLM Gateway

$15

3Anthropic

$15

Compare pricing across different API providers for this model.

External Sources

Artificial Analysis