Claude 3.5 Sonnet

AnthropicClaudeProprietary

Description

Claude 3.5 Sonnet is a powerful AI model with industry-leading software engineering skills. It excels in coding, planning, and problem-solving, with significant improvements in agentic coding and tool use tasks. The model includes computer use capabilities in public beta, allowing it to interact with computer interfaces like a human user.

Release Date

2024-10-22

Parameters

—

Context Length

200K

Modalities

image, pdf, text

Capability Radar

general

coding

reasoning

scienceest.

agents

100

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Agentic Capability	121	18.0	LS
Multimodal Ranking	1	94.0	LS

Benchmark Scores (LLM Stats)

Agents

OSWorld Extended

22.0%SR

OSWorld Screenshot-only

14.9%SR

Biology

GPQA

67.2%SR

Code

HumanEval

93.7%SR

SWE-Bench Verified

49.0%SR

Communication

TAU-bench Retail

69.2%SR

TAU-bench Airline

46.0%SR

Finance

MMLU

90.4%SR

MMLU-Pro

77.6%SR

General

MMMU

68.3%SR

Image To Text

DocVQA

95.2%SR

Language

BIG-Bench Hard

93.1%SR

Math

GSM8k

96.4%SR

MGSM

91.6%SR

DROP

87.1%SR

MATH

78.3%SR

MathVista

67.7%SR

Multimodal

AI2D

94.7%SR

ChartQA

90.8%SR

AA Evaluation Indices

No AA evaluation data available

LLM Stats Category Scores

Image To Text

100

Language

Math

Legal

Multimodal

Reasoning

Finance

General

Healthcare

Vision

Physics

Biology

Chemistry

Code

Communication

Tool Calling

Frontend Development

Pricing

Input Price$3 / 1M tokens

Output Price$15 / 1M tokens

Blended Price (3:1)$6 / 1M tokens

Cache Read Price$0.3 / 1M tokens

Cache Write Price$3.75 / 1M tokens

Speed

No speed data available

Provider Price Ranking

2 providers

Cheapest: AnthropicMost Expensive: LLM Gateway

ProviderInputOutput

1AnthropicPRIMARY

$15

2LLM Gateway

$15

Compare pricing across different API providers for this model.

External Sources

LLM Stats Artificial Analysis