GPT-4o (May '24)
OpenAIGPTProprietary
Description
GPT-4o ('o' for 'omni') is a multimodal AI model that accepts text, audio, image, and video inputs, and generates text, audio, and image outputs. It matches GPT-4 Turbo performance on text and code, with improvements in non-English languages, vision, and audio understanding.
Release Date
2024-05-13
Parameters
—
Context Length
128K
Modalities
image, pdf, text
Capability Radar
27
general
28
coding
40
reasoning
35
scienceest.
36
agents
85
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 273 | 33.0 | AA |
| General Ranking | 301 | 36.0 | AA |
| Math Reasoning | 204 | 45.0 | AA |
| Science | 323 | 35.0 | AA |
Benchmark Scores (LLM Stats)
Biology
GPQA
53.6%SR
Code
HumanEval
90.2%SR
Finance
MMLU
88.7%SR
MMLU-Pro
72.6%SR
Math
MGSM
90.5%SR
DROP
83.4%SR
MATH
76.6%SR
MathVista
63.8%SR
AA Evaluation Indices
Coding Index24.2
Intelligence Index8.6
Math 5000.8
Mmlu Pro0.7
Gpqa0.5
Livecodebench0.3
Scicode0.3
Aime0.1
Hle0.0
LLM Stats Category Scores
Code90
Language80
Legal80
Math80
Reasoning80
Finance80
Healthcare80
General70
Multimodal60
Vision60
Physics50
Biology50
Chemistry50
Pricing
Input Price$5 / 1M tokens
Output Price$15 / 1M tokens
Blended Price (3:1)$7.5 / 1M tokens
Cache Read Price$1.25 / 1M tokens
Speed
Tokens/sec104.8
Time to First Token0.51s
Time to Answer0.51s
Provider Price Ranking
Provider Price Ranking
2 providers
Cheapest: OpenAIMost Expensive: Azure
ProviderInputOutput
1OpenAICheapest
$0
$0.00001
2Azure
$0
$0.00001
Compare pricing across different API providers for this model.