GPT-4.5 (Preview)

OpenAIGPTProprietary

説明

GPT-4.5 is OpenAI's most advanced model, offering improved reasoning, coding, and creative capabilities with faster performance and longer context handling than GPT-4. It features enhanced instruction following, reduced hallucinations, and better factual accuracy.

リリース日

2025-02-27

パラメータ

—

コンテキスト長

—

モダリティ

image, text

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Biology

GPQA

69.5%自己申告

Code

HumanEval

88.0%自己申告

Aider-Polyglot Edit

44.9%自己申告

SWE-Bench Verified

38.0%自己申告

SWE-Lancer

37.3%自己申告

SWE-Lancer (IC-Diamond subset)

17.4%自己申告

Communication

Multi-IF

70.8%自己申告

TAU-bench Retail

68.4%自己申告

TAU-bench Airline

50.0%自己申告

Multi-Challenge

43.8%自己申告

Factuality

SimpleQA

62.5%自己申告

Finance

MMLU

90.8%自己申告

General

IFEval

88.2%自己申告

MMMLU

85.1%自己申告

MMMU

75.2%自己申告

Internal API instruction following (hard)

54.0%自己申告

Language

COLLIE

72.3%自己申告

Long Context

ComplexFuncBench

63.0%自己申告

OpenAI-MRCR: 2 needle 128k

38.5%自己申告

Math

GSM8k

97.0%自己申告

MathVista

72.3%自己申告

AIME 2024

36.7%自己申告

Multimodal

CharXiv-D

90.0%自己申告

CharXiv-R

55.4%自己申告

Reasoning

Graphwalks parents <128k

72.6%自己申告

Graphwalks BFS <128k

72.3%自己申告

AA評価指数

Intelligence Index

13.6

LLM Statsカテゴリスコア

Legal

Finance

Instruction Following

Language

Math

Healthcare

Multimodal

Physics

Spatial Reasoning

Structured Output

General

Biology

Chemistry

Vision

Writing

Reasoning

Factuality

Communication

Tool Calling

Long Context

Code

Frontend Development

価格設定

入力価格無料

出力価格無料

混合価格（3:1）無料

速度

トークン/秒0.0

初トークン遅延0.00s

初回答遅延0.00s

プロバイダー価格ランキング

プロバイダーデータがありません

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
総合ランキング	444	21.0	AA
マルチモーダルランキング	52	75.0	LS
推論	43	73.0	LS