Skip to main content

GPT-4o (Nov '24)

OpenAIGPT

Description

GPT-4o ('o' for 'omni') is a multimodal AI model that accepts text, audio, image, and video inputs, and generates text, audio, and image outputs. It matches GPT-4 Turbo performance on text and code, with improvements in non-English languages, vision, and audio understanding.

Release Date
2024-11-20
Parameters
Context Length
128K
Modalities
image, pdf, text

Capability Radar

29
general
31
coding
24
reasoning
37
scienceest.
50
agents
90
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking399
16.0
AA
General Ranking317
34.0
AA
Math Reasoning297
22.0
AA
Science308
37.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA53.6%SR

Code

HumanEval90.2%SR

Finance

MMLU88.7%SR
MMLU-Pro72.6%SR

Math

MGSM90.5%SR
DROP83.4%SR
MATH76.6%SR
MathVista63.8%SR

AA Evaluation Indices

Intelligence Index
11.2
Math Index
6.0
Math 500
0.8
Mmlu Pro
0.7
Gpqa
0.5
Ifbench
0.3
Scicode
0.3
Livecodebench
0.3
Tau2
0.3
Aime
0.1
Terminalbench Hard
0.1
Aime 25
0.1
Hle
0.0
Lcr
0.0

LLM Stats Category Scores

Code
90
Language
80
Legal
80
Math
80
Reasoning
80
Finance
80
Healthcare
80
General
70
Multimodal
60
Vision
60
Physics
50
Biology
50
Chemistry
50

Pricing

Input Price$2.5 / 1M tokens
Output Price$10 / 1M tokens
Blended Price (3:1)$4.375 / 1M tokens
Cache Read Price$1.25 / 1M tokens

Speed

Tokens/sec255.4
Time to First Token0.42s
Time to Answer0.42s

Provider Price Ranking

Provider Price Ranking

1 providers

ProviderInputOutput
1OpenAIPRIMARY
$2.5
$10

Compare pricing across different API providers for this model.

External Sources