o3

OpenAIOpenAI o-seriesProprietary

Descripción

OpenAI's most powerful reasoning model. o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following. Use it to think through multi-step problems that involve analysis across text, code, and images.

Fecha de lanzamiento

2025-04-16

Parámetros

—

Longitud del contexto

200K

Modalidades

image, pdf, text

Radar de capacidades

general

coding

reasoning

scienceest.

agents

multimodal

Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.

Rankings

Dominio	#Posición	Puntuación	Fuente
Capacidad agéntica	48	57.0	LS
Ranking de codificación	30	80.0	AA
Ranking general	64	72.0	AA
Razonamiento matemático	28	92.0	AA
Ranking multimodal	38	79.0	LS
Razonamiento	86	53.0	LS
Ciencia	87	63.0	AA

Puntuaciones de benchmarks (LLM Stats)

Agents

Tau-bench

63.0%Aut.

BrowseComp

49.7%Aut.

Biology

GPQA

83.3%Aut.

Code

Aider-Polyglot

81.3%Aut.

SWE-Bench Verified

69.1%Aut.

Communication

Tau2 Retail

80.2%Aut.

Tau2 Airline

64.8%Aut.

Multi-Challenge

60.4%Aut.

Tau2 Telecom

58.2%Aut.

General

MMMU

82.9%Aut.

MMMU-Pro

76.4%Aut.

Healthcare

VideoMMMU

83.3%Aut.

Language

COLLIE

98.4%Aut.

Math

AIME 2024

91.6%Aut.

MathVista

86.8%Aut.

AIME 2025

86.4%Aut.

FrontierMath

15.8%Aut.

Humanity's Last Exam

14.7%Aut.

Multimodal

CharXiv-R

78.6%Aut.

Reasoning

ARC-AGI

88.0%Aut.

ERQA

64.0%Aut.

ARC-AGI v2

6.5%Aut.

Índices de evaluación AA

Math Index

88.3

Intelligence Index

30.4

Math 500

1.0

Aime

0.9

Aime 25

0.9

Mmlu Pro

0.9

Gpqa

0.8

Livecodebench

0.8

Tau2

0.8

Ifbench

0.7

Lcr

0.7

Scicode

0.4

Terminalbench Hard

0.4

Hle

0.2

Puntuaciones por categoría LLM Stats

Language

100

Writing

100

Multimodal

Physics

General

Healthcare

Biology

Chemistry

Code

Reasoning

Frontend Development

Communication

Tool Calling

Math

Agents

Vision

Spatial Reasoning

Precios

Precio de entrada$2 / 1M tokens

Precio de salida$8 / 1M tokens

Precio mixto (3:1)$3.5 / 1M tokens

Precio de lectura caché$0.5 / 1M tokens

Velocidad

Tokens/seg168.9

Retraso del primer token6.19s

Tiempo hasta la respuesta6.19s

Ranking de Precios por Proveedor

16 proveedores

Más barato: PoeMás caro: Jiekou.AI

ProveedorEntradaSalida

1PoeMás barato

$1.8

$7.2

2OpenAIPRINCIPAL

3NanoGPT

4Abacus

5OpenRouter

6Kilo Gateway

7Cloudflare AI Gateway

8Helicone

9Azure Cognitive Services

10DigitalOcean

11Vercel AI Gateway

12LLM Gateway

13Azure

14NEAR AI Cloud

15Merge Gateway

16Jiekou.AI

$10

$40

Comparar precios entre diferentes proveedores de API para este modelo.

Fuentes externas

LLM Stats Artificial Analysis