GPT-4o (May '24)
OpenAIGPTProprietary
Description
GPT-4o ('o' for 'omni') is a multimodal AI model that accepts text, audio, image, and video inputs, and generates text, audio, and image outputs. It matches GPT-4 Turbo performance on text and code, with improvements in non-English languages, vision, and audio understanding.
Release Date
2024-05-13
Parameters
—
Context Length
128K
Modalities
file, image, text
Capability Radar
31
general
28
coding
40
reasoning
35
scienceest.
0
agents
85
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 214 | 39.0 | AA |
| General Ranking | 274 | 39.0 | AA |
| Math Reasoning | 204 | 45.0 | AA |
| Science | 300 | 36.0 | AA |
Benchmark Scores (LLM Stats)
Biology
GPQA
53.6%SR
Code
HumanEval
90.2%SR
Finance
MMLU
88.7%SR
MMLU-Pro
72.6%SR
Math
MGSM
90.5%SR
DROP
83.4%SR
MATH
76.6%SR
MathVista
63.8%SR
AA Evaluation Indices
Coding Index24.2
Intelligence Index14.5
Math 5000.8
Mmlu Pro0.7
Gpqa0.5
Livecodebench0.3
Scicode0.3
Aime0.1
Hle0.0
LLM Stats Category Scores
Code90
Finance80
Healthcare80
Language80
Legal80
Math80
Reasoning80
General70
Vision60
Multimodal60
Biology50
Chemistry50
Physics50
Pricing
Input Price$5 / 1M tokens
Output Price$15 / 1M tokens
Blended Price (3:1)$7.5 / 1M tokens
Speed
Tokens/sec102.9 tokens/s
Time to First Token0.67s
Time to Answer0.67s
Available Providers
(LS internal units)| Provider | Input Price | Output Price |
|---|---|---|
| OpenAI | 2.5M | 10.0M |
| Azure | 2.5M | 10.0M |