GPT-4o (Nov '24)
OpenAIGPT
Description
GPT-4o ('o' for 'omni') is a multimodal AI model that accepts text, audio, image, and video inputs, and generates text, audio, and image outputs. It matches GPT-4 Turbo performance on text and code, with improvements in non-English languages, vision, and audio understanding.
Release Date
2024-11-20
Parameters
—
Context Length
128K
Modalities
image, pdf, text
Capability Radar
29
general
31
coding
24
reasoning
37
scienceest.
50
agents
90
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 399 | 16.0 | AA |
| General Ranking | 317 | 34.0 | AA |
| Math Reasoning | 297 | 22.0 | AA |
| Science | 308 | 37.0 | AA |
Benchmark Scores (LLM Stats)
Biology
GPQA
53.6%SR
Code
HumanEval
90.2%SR
Finance
MMLU
88.7%SR
MMLU-Pro
72.6%SR
Math
MGSM
90.5%SR
DROP
83.4%SR
MATH
76.6%SR
MathVista
63.8%SR
AA Evaluation Indices
Intelligence Index11.2
Math Index6.0
Math 5000.8
Mmlu Pro0.7
Gpqa0.5
Ifbench0.3
Scicode0.3
Livecodebench0.3
Tau20.3
Aime0.1
Terminalbench Hard0.1
Aime 250.1
Hle0.0
Lcr0.0
LLM Stats Category Scores
Code90
Language80
Legal80
Math80
Reasoning80
Finance80
Healthcare80
General70
Multimodal60
Vision60
Physics50
Biology50
Chemistry50
Pricing
Input Price$2.5 / 1M tokens
Output Price$10 / 1M tokens
Blended Price (3:1)$4.375 / 1M tokens
Cache Read Price$1.25 / 1M tokens
Speed
Tokens/sec255.4
Time to First Token0.42s
Time to Answer0.42s
Provider Price Ranking
Provider Price Ranking
1 providers
ProviderInputOutput
1OpenAIPRIMARY
$2.5
$10
Compare pricing across different API providers for this model.