GPT-4.5 (Preview)

OpenAIGPTProprietary

설명

GPT-4.5 is OpenAI's most advanced model, offering improved reasoning, coding, and creative capabilities with faster performance and longer context handling than GPT-4. It features enhanced instruction following, reduced hallucinations, and better factual accuracy.

출시일

2025-02-27

파라미터

—

컨텍스트 길이

—

모달리티

image, text

능력 레이더

general

coding

reasoning

science추정

agents

multimodal

전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.

랭킹

도메인	#순위	점수	소스
종합 랭킹	444	21.0	AA
멀티모달 랭킹	52	75.0	LS
추론	43	73.0	LS

벤치마크 점수 (LLM Stats)

Biology

GPQA

69.5%자체 보고

Code

HumanEval

88.0%자체 보고

Aider-Polyglot Edit

44.9%자체 보고

SWE-Bench Verified

38.0%자체 보고

SWE-Lancer

37.3%자체 보고

SWE-Lancer (IC-Diamond subset)

17.4%자체 보고

Communication

Multi-IF

70.8%자체 보고

TAU-bench Retail

68.4%자체 보고

TAU-bench Airline

50.0%자체 보고

Multi-Challenge

43.8%자체 보고

Factuality

SimpleQA

62.5%자체 보고

Finance

MMLU

90.8%자체 보고

General

IFEval

88.2%자체 보고

MMMLU

85.1%자체 보고

MMMU

75.2%자체 보고

Internal API instruction following (hard)

54.0%자체 보고

Language

COLLIE

72.3%자체 보고

Long Context

ComplexFuncBench

63.0%자체 보고

OpenAI-MRCR: 2 needle 128k

38.5%자체 보고

Math

GSM8k

97.0%자체 보고

MathVista

72.3%자체 보고

AIME 2024

36.7%자체 보고

Multimodal

CharXiv-D

90.0%자체 보고

CharXiv-R

55.4%자체 보고

Reasoning

Graphwalks parents <128k

72.6%자체 보고

Graphwalks BFS <128k

72.3%자체 보고

AA 평가 지수

Intelligence Index

13.6

LLM Stats 카테고리 점수

Legal

Finance

Instruction Following

Language

Math

Healthcare

Multimodal

Physics

Spatial Reasoning

Structured Output

General

Biology

Chemistry

Vision

Writing

Reasoning

Factuality

Communication

Tool Calling

Long Context

Code

Frontend Development

가격

입력 가격무료

출력 가격무료

혼합 가격 (3:1)무료

속도

토큰/초0.0

첫 토큰 지연0.00s

첫 응답 지연0.00s

공급자 가격 순위

프로바이더 데이터가 없습니다

외부 링크

LLM Stats Artificial Analysis