Claude 3.5 Sonnet

AnthropicClaudeProprietary

描述

Claude 3.5 Sonnet is a powerful AI model with industry-leading software engineering skills. It excels in coding, planning, and problem-solving, with significant improvements in agentic coding and tool use tasks. The model includes computer use capabilities in public beta, allowing it to interact with computer interfaces like a human user.

發布日期

2024-10-22

參數規模

—

上下文長度

200K

支援模態

image, pdf, text

能力雷達圖

general

coding

reasoning

science估算

agents

100

multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域	#排名	分數	來源
智慧體能力模型榜	121	18.0	LS
多模態榜	1	94.0	LS

基準測試分數 (LLM Stats)

Agents

OSWorld Extended

22.0%自報

OSWorld Screenshot-only

14.9%自報

Biology

GPQA

67.2%自報

Code

HumanEval

93.7%自報

SWE-Bench Verified

49.0%自報

Communication

TAU-bench Retail

69.2%自報

TAU-bench Airline

46.0%自報

Finance

MMLU

90.4%自報

MMLU-Pro

77.6%自報

General

MMMU

68.3%自報

Image To Text

DocVQA

95.2%自報

Language

BIG-Bench Hard

93.1%自報

Math

GSM8k

96.4%自報

MGSM

91.6%自報

DROP

87.1%自報

MATH

78.3%自報

MathVista

67.7%自報

Multimodal

AI2D

94.7%自報

ChartQA

90.8%自報

AA 評測指數

暫無 AA 評測資料

LLM Stats 分類評分

Image To Text

100

Language

Math

Legal

Multimodal

Reasoning

Finance

General

Healthcare

Vision

Physics

Biology

Chemistry

Code

Communication

Tool Calling

Frontend Development

定價

輸入價格$3 / 1M tokens

輸出價格$15 / 1M tokens

混合價格(3:1)$6 / 1M tokens

快取讀取價格$0.3 / 1M tokens

快取寫入價格$3.75 / 1M tokens

速度

暫無速度資料

供應商價格排行

2 個供應商

最便宜: Anthropic最貴: LLM Gateway

供應商輸入輸出

1Anthropic主要

$15

2LLM Gateway

$15

比較該模型在不同 API 供應商之間的定價。

外部連結

LLM Stats Artificial Analysis