o3

OpenAIOpenAI o-seriesProprietary

Description

OpenAI's most powerful reasoning model. o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following. Use it to think through multi-step problems that involve analysis across text, code, and images.

Date de sortie

2025-04-16

Paramètres

—

Longueur du contexte

200K

Modalités

image, pdf, text

Radar de capacités

general

coding

reasoning

scienceest.

agents

multimodal

Science utilise un proxy de raisonnement lorsque les benchmarks scientifiques dédiés ne sont pas disponibles.

Classements

Domaine	#Rang	Score	Source
Capacité agentique	48	57.0	LS
Classement codage	30	80.0	AA
Classement général	64	72.0	AA
Raisonnement mathématique	28	92.0	AA
Classement multimodal	38	79.0	LS
Raisonnement	86	53.0	LS
Science	87	63.0	AA

Scores de benchmarks (LLM Stats)

Agents

Tau-bench

63.0%Aut.

BrowseComp

49.7%Aut.

Biology

GPQA

83.3%Aut.

Code

Aider-Polyglot

81.3%Aut.

SWE-Bench Verified

69.1%Aut.

Communication

Tau2 Retail

80.2%Aut.

Tau2 Airline

64.8%Aut.

Multi-Challenge

60.4%Aut.

Tau2 Telecom

58.2%Aut.

General

MMMU

82.9%Aut.

MMMU-Pro

76.4%Aut.

Healthcare

VideoMMMU

83.3%Aut.

Language

COLLIE

98.4%Aut.

Math

AIME 2024

91.6%Aut.

MathVista

86.8%Aut.

AIME 2025

86.4%Aut.

FrontierMath

15.8%Aut.

Humanity's Last Exam

14.7%Aut.

Multimodal

CharXiv-R

78.6%Aut.

Reasoning

ARC-AGI

88.0%Aut.

ERQA

64.0%Aut.

ARC-AGI v2

6.5%Aut.

Indices d'évaluation AA

Math Index

88.3

Intelligence Index

30.4

Math 500

1.0

Aime

0.9

Aime 25

0.9

Mmlu Pro

0.9

Gpqa

0.8

Livecodebench

0.8

Tau2

0.8

Ifbench

0.7

Lcr

0.7

Scicode

0.4

Terminalbench Hard

0.4

Hle

0.2

Scores par catégorie LLM Stats

Language

100

Writing

100

Multimodal

Physics

General

Healthcare

Biology

Chemistry

Code

Reasoning

Frontend Development

Communication

Tool Calling

Math

Agents

Vision

Spatial Reasoning

Tarification

Prix d'entrée$2 / 1M tokens

Prix de sortie$8 / 1M tokens

Prix mixte (3:1)$3.5 / 1M tokens

Prix de lecture cache$0.5 / 1M tokens

Vitesse

Tokens/sec168.9

Délai du premier token6.19s

Temps de réponse6.19s

Classement des Prix par Fournisseur

16 fournisseurs

Moins cher: PoePlus cher: Jiekou.AI

FournisseurEntréeSortie

1PoeMoins cher

$1.8

$7.2

2OpenAIPRINCIPAL

3NanoGPT

4Abacus

5OpenRouter

6Kilo Gateway

7Cloudflare AI Gateway

8Helicone

9Azure Cognitive Services

10DigitalOcean

11Vercel AI Gateway

12LLM Gateway

13Azure

14NEAR AI Cloud

15Merge Gateway

16Jiekou.AI

$10

$40

Comparer les prix entre différents fournisseurs API pour ce modèle.

Sources externes

LLM Stats Artificial Analysis