o1-preview
OpenAIOpenAI o-seriesProprietary
Description
A research preview model focused on mathematical and logical reasoning capabilities, demonstrating improved performance on tasks requiring step-by-step reasoning, mathematical problem-solving, and code generation. The model shows enhanced capabilities in formal reasoning while maintaining strong general capabilities.
Release Date
2024-09-12
Parameters
—
Context Length
200K
Modalities
file, image, text
Capability Radar
24
general
34
coding
92
reasoning
60
scienceest.
0
agents
80
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 109 | 58.0 | AA |
| General Ranking | 325 | 34.0 | AA |
| Math Reasoning | 27 | 93.0 | AA |
Benchmark Scores (LLM Stats)
Biology
GPQA
73.3%SR
Code
SWE-Bench Verified
41.3%SR
Factuality
SimpleQA
42.4%SR
Finance
MMLU
90.8%SR
General
LiveBench
52.3%SR
Math
MGSM
90.8%SR
MATH
85.5%SR
AIME 2024
42.0%SR
AA Evaluation Indices
Coding Index34.0
Intelligence Index23.7
Math 5000.9
LLM Stats Category Scores
Finance90
Healthcare90
Language90
Legal90
Biology70
Chemistry70
Math70
Physics70
General60
Reasoning60
Code40
Factuality40
Frontend Development40
Pricing
Input Price$16.5 / 1M tokens
Output Price$66 / 1M tokens
Blended Price (3:1)$28.875 / 1M tokens
Speed
Tokens/sec0.0 tokens/s
Time to First Token0.00s
Time to Answer0.00s
Available Providers
(LS internal units)No provider data available