o3

OpenAIOpenAI o-seriesProprietary

描述

OpenAI's most powerful reasoning model. o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following. Use it to think through multi-step problems that involve analysis across text, code, and images.

發布日期

2025-04-16

參數規模

—

上下文長度

200K

支援模態

image, pdf, text

能力雷達圖

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域	#排名	分數	來源
智慧體能力模型榜	48	57.0	LS
程式碼能力榜	30	80.0	AA
通用能力榜	64	72.0	AA
數學推理	28	92.0	AA
多模態榜	38	79.0	LS
推理能力	86	53.0	LS
科學能力	87	63.0	AA

基準測試分數 (LLM Stats)

Agents

Tau-bench

63.0%自報

BrowseComp

49.7%自報

Biology

GPQA

83.3%自報

Code

Aider-Polyglot

81.3%自報

SWE-Bench Verified

69.1%自報

Communication

Tau2 Retail

80.2%自報

Tau2 Airline

64.8%自報

Multi-Challenge

60.4%自報

Tau2 Telecom

58.2%自報

General

MMMU

82.9%自報

MMMU-Pro

76.4%自報

Healthcare

VideoMMMU

83.3%自報

Language

COLLIE

98.4%自報

Math

AIME 2024

91.6%自報

MathVista

86.8%自報

AIME 2025

86.4%自報

FrontierMath

15.8%自報

Humanity's Last Exam

14.7%自報

Multimodal

CharXiv-R

78.6%自報

Reasoning

ARC-AGI

88.0%自報

ERQA

64.0%自報

ARC-AGI v2

6.5%自報

AA 評測指數

Math Index

88.3

Intelligence Index

30.4

Math 500

1.0

Aime

0.9

Aime 25

0.9

Mmlu Pro

0.9

Gpqa

0.8

Livecodebench

0.8

Tau2

0.8

Ifbench

0.7

Lcr

0.7

Scicode

0.4

Terminalbench Hard

0.4

Hle

0.2

LLM Stats 分類評分

Language

100

Writing

100

Multimodal

Physics

General

Healthcare

Biology

Chemistry

Code

Reasoning

Frontend Development

Communication

Tool Calling

Math

Agents

Vision

Spatial Reasoning

定價

輸入價格$2 / 1M tokens

輸出價格$8 / 1M tokens

混合價格(3:1)$3.5 / 1M tokens

快取讀取價格$0.5 / 1M tokens

速度

Tokens/秒168.9

首Token延遲6.19s

首回答延遲6.19s

供應商價格排行

16 個供應商

最便宜: Poe最貴: Jiekou.AI

供應商輸入輸出

1Poe最便宜

$1.8

$7.2

2OpenAI主要

3NanoGPT

4Abacus

5OpenRouter

6Kilo Gateway

7Cloudflare AI Gateway

8Helicone

9Azure Cognitive Services

10DigitalOcean

11Vercel AI Gateway

12LLM Gateway

13Azure

14NEAR AI Cloud

15Merge Gateway

16Jiekou.AI

$10

$40

比較該模型在不同 API 供應商之間的定價。

外部連結

LLM Stats Artificial Analysis