Claude 3.5 Sonnet

AnthropicClaudeProprietary

説明

Claude 3.5 Sonnet is a powerful AI model with industry-leading software engineering skills. It excels in coding, planning, and problem-solving, with significant improvements in agentic coding and tool use tasks. The model includes computer use capabilities in public beta, allowing it to interact with computer interfaces like a human user.

リリース日

2024-10-22

パラメータ

—

コンテキスト長

200K

モダリティ

image, pdf, text

能力レーダー

general

coding

reasoning

science推定

agents

100

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Agents

OSWorld Extended

22.0%自己申告

OSWorld Screenshot-only

14.9%自己申告

Biology

GPQA

67.2%自己申告

Code

HumanEval

93.7%自己申告

SWE-Bench Verified

49.0%自己申告

Communication

TAU-bench Retail

69.2%自己申告

TAU-bench Airline

46.0%自己申告

Finance

MMLU

90.4%自己申告

MMLU-Pro

77.6%自己申告

General

MMMU

68.3%自己申告

Image To Text

DocVQA

95.2%自己申告

Language

BIG-Bench Hard

93.1%自己申告

Math

GSM8k

96.4%自己申告

MGSM

91.6%自己申告

DROP

87.1%自己申告

MATH

78.3%自己申告

MathVista

67.7%自己申告

Multimodal

AI2D

94.7%自己申告

ChartQA

90.8%自己申告

AA評価指数

AA評価データがありません

LLM Statsカテゴリスコア

Image To Text

100

Language

Math

Legal

Multimodal

Reasoning

Finance

General

Healthcare

Vision

Physics

Biology

Chemistry

Code

Communication

Tool Calling

Frontend Development

価格設定

入力価格$3 / 1Mトークン

出力価格$15 / 1Mトークン

混合価格（3:1）$6 / 1Mトークン

キャッシュ読み取り価格$0.3 / 1Mトークン

キャッシュ書き込み価格$3.75 / 1Mトークン

速度

速度データがありません

プロバイダー価格ランキング

2 プロバイダー

最安: Anthropic最高: LLM Gateway

プロバイダー入力出力

1Anthropicプライマリ

$15

2LLM Gateway

$15

このモデルの異なるAPIプロバイダー間の価格を比較。

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
エージェント能力	121	18.0	LS
マルチモーダルランキング	1	94.0	LS