Claude 3.5 Sonnet

AnthropicClaudeProprietary

Descripción

Claude 3.5 Sonnet is a powerful AI model with industry-leading software engineering skills. It excels in coding, planning, and problem-solving, with significant improvements in agentic coding and tool use tasks. The model includes computer use capabilities in public beta, allowing it to interact with computer interfaces like a human user.

Fecha de lanzamiento

2024-10-22

Parámetros

—

Longitud del contexto

200K

Modalidades

image, pdf, text

Radar de capacidades

general

coding

reasoning

scienceest.

agents

100

multimodal

Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.

Rankings

Dominio	#Posición	Puntuación	Fuente
Capacidad agéntica	121	18.0	LS
Ranking multimodal	1	94.0	LS

Puntuaciones de benchmarks (LLM Stats)

Agents

OSWorld Extended

22.0%Aut.

OSWorld Screenshot-only

14.9%Aut.

Biology

GPQA

67.2%Aut.

Code

HumanEval

93.7%Aut.

SWE-Bench Verified

49.0%Aut.

Communication

TAU-bench Retail

69.2%Aut.

TAU-bench Airline

46.0%Aut.

Finance

MMLU

90.4%Aut.

MMLU-Pro

77.6%Aut.

General

MMMU

68.3%Aut.

Image To Text

DocVQA

95.2%Aut.

Language

BIG-Bench Hard

93.1%Aut.

Math

GSM8k

96.4%Aut.

MGSM

91.6%Aut.

DROP

87.1%Aut.

MATH

78.3%Aut.

MathVista

67.7%Aut.

Multimodal

AI2D

94.7%Aut.

ChartQA

90.8%Aut.

Índices de evaluación AA

No hay datos de evaluación AA disponibles

Puntuaciones por categoría LLM Stats

Image To Text

100

Language

Math

Legal

Multimodal

Reasoning

Finance

General

Healthcare

Vision

Physics

Biology

Chemistry

Code

Communication

Tool Calling

Frontend Development

Precios

Precio de entrada$3 / 1M tokens

Precio de salida$15 / 1M tokens

Precio mixto (3:1)$6 / 1M tokens

Precio de lectura caché$0.3 / 1M tokens

Precio de escritura caché$3.75 / 1M tokens

Velocidad

No hay datos de velocidad disponibles

Ranking de Precios por Proveedor

2 proveedores

Más barato: AnthropicMás caro: LLM Gateway

ProveedorEntradaSalida

1AnthropicPRINCIPAL

$15

2LLM Gateway

$15

Comparar precios entre diferentes proveedores de API para este modelo.

Fuentes externas

LLM Stats Artificial Analysis