o3-mini (high)

OpenAIOpenAI o-series

Description

A smaller variant of O3, expected to offer enhanced multimodal capabilities, improved reasoning, and more efficient resource utilization compared to previous models while maintaining strong performance on core tasks.

Release Date

2025-01-31

Parameters

—

Context Length

200K

Modalities

text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Code Ranking	185	50.0	AA
General Ranking	184	52.0	AA
Math Reasoning	20	95.0	AA
Science	135	56.0	AA

Benchmark Scores (LLM Stats)

Biology

GPQA

77.2%SR

Code

Aider-Polyglot

66.7%SR

Aider-Polyglot Edit

60.4%SR

SWE-Bench Verified

49.3%SR

SWE-Lancer

18.0%SR

SWE-Lancer (IC-Diamond subset)

7.4%SR

Communication

Multi-IF

79.5%SR

TAU-bench Retail

57.6%SR

Multi-Challenge

39.9%SR

TAU-bench Airline

32.4%SR

Factuality

SimpleQA

15.0%SR

Finance

MMLU

86.9%SR

General

IFEval

93.9%SR

LiveBench

84.6%SR

Multilingual MMLU

80.7%SR

Internal API instruction following (hard)

50.0%SR

Language

COLLIE

98.7%SR

Long Context

OpenAI-MRCR: 2 needle 128k

18.7%SR

ComplexFuncBench

17.6%SR

Math

MATH

97.9%SR

MGSM

92.0%SR

AIME 2024

87.3%SR

FrontierMath

9.2%SR

Reasoning

Graphwalks parents <128k

58.3%SR

Graphwalks BFS <128k

51.0%SR

AA Evaluation Indices

Coding Index

42.1

Intelligence Index

18.4

Math 500

1.0

Aime

0.9

Mmlu Pro

0.8

Gpqa

0.8

Livecodebench

0.7

Ifbench

0.7

Scicode

0.4

Lcr

0.4

Tau2

0.3

Hle

0.1

Terminalbench Hard

0.1

LLM Stats Category Scores

Writing

100

Instruction Following

Language

Legal

Finance

Healthcare

Math

Physics

Biology

Chemistry

General

Reasoning

Structured Output

Spatial Reasoning

Frontend Development

Communication

Code

Tool Calling

Long Context

Factuality

Pricing

Input Price$1.1 / 1M tokens

Output Price$4.4 / 1M tokens

Blended Price (3:1)$1.925 / 1M tokens

Cache Read Price$0.55 / 1M tokens

Speed

Tokens/sec235.1

Time to First Token20.86s

Time to Answer20.86s

Provider Price Ranking

9 providers

Cheapest: PoeMost Expensive: Merge Gateway

ProviderInputOutput

1PoeCheapest

$0.99

2OpenAIPRIMARY

$1.1

$4.4

3NanoGPT

$1.1

$4.4

4OpenRouter

$1.1

$4.4

5Kilo Gateway

$1.1

$4.4

6Cloudflare AI Gateway

$1.1

$4.4

7Vercel AI Gateway

$1.1

$4.4

8NEAR AI Cloud

$1.1

$4.4

9Merge Gateway

$1.1

$4.4

Compare pricing across different API providers for this model.

External Sources

Artificial Analysis