Claude 3.7 Sonnet (Reasoning)

AnthropicClaude

説明

The most intelligent Claude model and the first hybrid reasoning model on the market. Claude 3.7 Sonnet can produce near-instant responses or extended, step-by-step thinking that is made visible to the user. Shows particularly strong improvements in coding and front-end web development.

リリース日

2025-02-24

パラメータ

—

コンテキスト長

200K

モダリティ

image, pdf, text

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Agents

Terminal-Bench

35.2%自己申告

Biology

GPQA

84.8%自己申告

Code

SWE-Bench Verified

70.3%自己申告

Communication

TAU-bench Retail

81.2%自己申告

TAU-bench Airline

58.4%自己申告

General

IFEval

93.2%自己申告

MMMLU

86.1%自己申告

MMMU

75.0%自己申告

Math

MATH-500

96.2%自己申告

AIME 2024

80.0%自己申告

AIME 2025

54.8%自己申告

AA評価指数

Math Index

56.3

Coding Index

36.4

Intelligence Index

27.1

Math 500

0.9

Mmlu Pro

0.8

Gpqa

0.8

Lcr

0.6

Aime 25

0.6

Tau2

0.5

Aime

0.5

Ifbench

0.5

Livecodebench

0.5

Scicode

0.4

Terminalbench Hard

0.2

Hle

0.1

LLM Statsカテゴリスコア

Instruction Following

Language

Structured Output

Math

Multimodal

Physics

General

Healthcare

Biology

Chemistry

Vision

Reasoning

Frontend Development

Communication

Tool Calling

Code

Agents

価格設定

入力価格無料

出力価格無料

混合価格（3:1）無料

キャッシュ読み取り価格$0.3 / 1Mトークン

キャッシュ書き込み価格$3.75 / 1Mトークン

速度

トークン/秒0.0

初トークン遅延0.00s

初回答遅延0.00s

プロバイダー価格ランキング

3 プロバイダー

最安: Abacus最高: Anthropic

プロバイダー入力出力

1Abacus最安

$15

2LLM Gateway

$15

3Anthropic

$15

このモデルの異なるAPIプロバイダー間の価格を比較。

外部リンク

Artificial Analysis

ドメイン	#順位	スコア	ソース
エージェント能力	111	35.0	LS
コーディングランキング	170	52.0	AA
総合ランキング	148	57.0	AA
数学的推論	145	63.0	AA
科学	148	55.0	AA