Skip to main content

GPT-4o (May '24)

OpenAIGPTProprietary

Description

GPT-4o ('o' for 'omni') is a multimodal AI model that accepts text, audio, image, and video inputs, and generates text, audio, and image outputs. It matches GPT-4 Turbo performance on text and code, with improvements in non-English languages, vision, and audio understanding.

Release Date
2024-05-13
Parameters
Context Length
128K
Modalities
file, image, text

Capability Radar

31
general
28
coding
40
reasoning
35
scienceest.
0
agents
85
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking214
39.0
AA
General Ranking274
39.0
AA
Math Reasoning204
45.0
AA
Science300
36.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA53.6%SR

Code

HumanEval90.2%SR

Finance

MMLU88.7%SR
MMLU-Pro72.6%SR

Math

MGSM90.5%SR
DROP83.4%SR
MATH76.6%SR
MathVista63.8%SR

AA Evaluation Indices

Coding Index
24.2
Intelligence Index
14.5
Math 500
0.8
Mmlu Pro
0.7
Gpqa
0.5
Livecodebench
0.3
Scicode
0.3
Aime
0.1
Hle
0.0

LLM Stats Category Scores

Code
90
Finance
80
Healthcare
80
Language
80
Legal
80
Math
80
Reasoning
80
General
70
Vision
60
Multimodal
60
Biology
50
Chemistry
50
Physics
50

Pricing

Input Price$5 / 1M tokens
Output Price$15 / 1M tokens
Blended Price (3:1)$7.5 / 1M tokens

Speed

Tokens/sec102.9 tokens/s
Time to First Token0.67s
Time to Answer0.67s

Available Providers

(LS internal units)
ProviderInput PriceOutput Price
OpenAI2.5M10.0M
Azure2.5M10.0M

External Sources