Claude 3.7 Sonnet (Reasoning)

AnthropicClaude

描述

The most intelligent Claude model and the first hybrid reasoning model on the market. Claude 3.7 Sonnet can produce near-instant responses or extended, step-by-step thinking that is made visible to the user. Shows particularly strong improvements in coding and front-end web development.

發布日期

2025-02-24

參數規模

—

上下文長度

200K

支援模態

image, pdf, text

能力雷達圖

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域	#排名	分數	來源
智慧體能力模型榜	111	35.0	LS
程式碼能力榜	170	52.0	AA
通用能力榜	148	57.0	AA
數學推理	145	63.0	AA
科學能力	148	55.0	AA

基準測試分數 (LLM Stats)

Agents

Terminal-Bench

35.2%自報

Biology

GPQA

84.8%自報

Code

SWE-Bench Verified

70.3%自報

Communication

TAU-bench Retail

81.2%自報

TAU-bench Airline

58.4%自報

General

IFEval

93.2%自報

MMMLU

86.1%自報

MMMU

75.0%自報

Math

MATH-500

96.2%自報

AIME 2024

80.0%自報

AIME 2025

54.8%自報

AA 評測指數

Math Index

56.3

Coding Index

36.4

Intelligence Index

27.1

Math 500

0.9

Mmlu Pro

0.8

Gpqa

0.8

Lcr

0.6

Aime 25

0.6

Tau2

0.5

Aime

0.5

Ifbench

0.5

Livecodebench

0.5

Scicode

0.4

Terminalbench Hard

0.2

Hle

0.1

LLM Stats 分類評分

Instruction Following

Language

Structured Output

Math

Multimodal

Physics

General

Healthcare

Biology

Chemistry

Vision

Reasoning

Frontend Development

Communication

Tool Calling

Code

Agents

定價

輸入價格免費

輸出價格免費

混合價格(3:1)免費

快取讀取價格$0.3 / 1M tokens

快取寫入價格$3.75 / 1M tokens

速度

Tokens/秒0.0

首Token延遲0.00s

首回答延遲0.00s

供應商價格排行

3 個供應商

最便宜: Abacus最貴: Anthropic

供應商輸入輸出

1Abacus最便宜

$15

2LLM Gateway

$15

3Anthropic

$15

比較該模型在不同 API 供應商之間的定價。

外部連結

Artificial Analysis