Skip to main content

o3-mini

OpenAIOpenAI o-seriesProprietary

Description

A smaller variant of O3, expected to offer enhanced multimodal capabilities, improved reasoning, and more efficient resource utilization compared to previous models while maintaining strong performance on core tasks.

Release Date
2025-01-31
Parameters
Context Length
200K
Modalities
text

Capability Radar

35
general
65
coding
83
reasoning
49
scienceest.
40
agents
0
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking217
45.0
AA
General Ranking234
45.0
AA
Math Reasoning50
89.0
AA
Reasoning83
54.0
LS
Science168
52.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA77.2%SR

Code

Aider-Polyglot66.7%SR
Aider-Polyglot Edit60.4%SR
SWE-Bench Verified49.3%SR
SWE-Lancer18.0%SR
SWE-Lancer (IC-Diamond subset)7.4%SR

Communication

Multi-IF79.5%SR
TAU-bench Retail57.6%SR
Multi-Challenge39.9%SR
TAU-bench Airline32.4%SR

Factuality

SimpleQA15.0%SR

Finance

MMLU86.9%SR

General

IFEval93.9%SR
LiveBench84.6%SR
Multilingual MMLU80.7%SR
Internal API instruction following (hard)50.0%SR

Language

COLLIE98.7%SR

Long Context

OpenAI-MRCR: 2 needle 128k18.7%SR
ComplexFuncBench17.6%SR

Math

MATH97.9%SR
MGSM92.0%SR
AIME 202487.3%SR
FrontierMath9.2%SR

Reasoning

Graphwalks parents <128k58.3%SR
Graphwalks BFS <128k51.0%SR

AA Evaluation Indices

Intelligence Index
19.0
Math 500
1.0
Mmlu Pro
0.8
Aime
0.8
Gpqa
0.7
Livecodebench
0.7
Scicode
0.4
Tau2
0.3
Hle
0.1
Terminalbench Hard
0.1

LLM Stats Category Scores

Writing
100
Instruction Following
90
Language
90
Legal
90
Finance
90
Healthcare
90
Math
80
Physics
80
Biology
80
Chemistry
80
General
70
Reasoning
60
Structured Output
60
Spatial Reasoning
50
Frontend Development
50
Communication
50
Code
40
Tool Calling
40
Long Context
20
Factuality
10

Pricing

Input Price$1.1 / 1M tokens
Output Price$4.4 / 1M tokens
Blended Price (3:1)$1.925 / 1M tokens
Cache Read Price$0.55 / 1M tokens

Speed

Tokens/sec229.8
Time to First Token5.43s
Time to Answer5.43s

Provider Price Ranking

Provider Price Ranking

9 providers

Cheapest: NanoGPTMost Expensive: Azure
ProviderInputOutput
1NanoGPTCheapest
$1.088
$4.3996
2OpenAIPRIMARY
$1.1
$4.4
3Abacus
$1.1
$4.4
4Jiekou.AI
$1.1
$4.4
5Helicone
$1.1
$4.4
6Azure Cognitive Services
$1.1
$4.4
7DigitalOcean
$1.1
$4.4
8LLM Gateway
$1.1
$4.4
9Azure
$1.1
$4.4

Compare pricing across different API providers for this model.

External Sources