o1

OpenAIOpenAI o-seriesProprietary

Description

A research preview model focused on mathematical and logical reasoning capabilities, demonstrating improved performance on tasks requiring step-by-step reasoning, mathematical problem-solving, and code generation. The model shows enhanced capabilities in formal reasoning while maintaining strong general capabilities.

Release Date

2024-12-05

Parameters

—

Context Length

200K

Modalities

image, pdf, text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Code Ranking	151	55.0	AA
General Ranking	105	63.0	AA
Math Reasoning	55	87.0	AA
Science	195	49.0	AA

Benchmark Scores (LLM Stats)

Biology

GPQA

78.0%SR

GPQA Biology

69.2%SR

Chemistry

GPQA Chemistry

64.7%SR

Code

HumanEval

88.1%SR

SWE-Bench Verified

41.0%SR

Communication

TAU-bench Retail

70.8%SR

TAU-bench Airline

50.0%SR

Factuality

SimpleQA

47.0%SR

Finance

MMLU

91.8%SR

General

MMMLU

87.7%SR

MMMU

77.6%SR

LiveBench

67.0%SR

Math

GSM8k

97.1%SR

MATH

96.4%SR

MGSM

89.3%SR

AIME 2024

74.3%SR

MathVista

71.8%SR

FrontierMath

5.5%SR

Physics

GPQA Physics

92.8%SR

AA Evaluation Indices

Coding Index

39.7

Intelligence Index

23.4

Math 500

1.0

Mmlu Pro

0.8

Gpqa

0.7

Aime

0.7

Ifbench

0.7

Livecodebench

0.7

Tau2

0.6

Lcr

0.6

Scicode

0.4

Terminalbench Hard

0.1

Hle

0.1

LLM Stats Category Scores

Language

Legal

Finance

Math

Physics

Healthcare

Biology

Chemistry

Multimodal

Reasoning

General

Vision

Code

Communication

Tool Calling

Factuality

Frontend Development

Pricing

Input Price$15 / 1M tokens

Output Price$60 / 1M tokens

Blended Price (3:1)$26.25 / 1M tokens

Cache Read Price$7.5 / 1M tokens

Speed

Tokens/sec147.9

Time to First Token13.04s

Time to Answer13.04s

Provider Price Ranking

13 providers

Cheapest: PoeMost Expensive: Merge Gateway

ProviderInputOutput

1PoeCheapest

$14

$54

2NanoGPT

$14.994

$59.993

3OpenAIPRIMARY

$15

$60

4OpenRouter

$15

$60

5Kilo Gateway

$15

$60

6Cloudflare AI Gateway

$15

$60

7Helicone

$15

$60

8Azure Cognitive Services

$15

$60

9DigitalOcean

$15

$60

10Vercel AI Gateway

$15

$60

11LLM Gateway

$15

$60

12Azure

$15

$60

13Merge Gateway

$15

$60

Compare pricing across different API providers for this model.

External Sources

LLM Stats Artificial Analysis