o3

OpenAIOpenAI o-seriesProprietary

説明

OpenAI's most powerful reasoning model. o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following. Use it to think through multi-step problems that involve analysis across text, code, and images.

リリース日

2025-04-16

パラメータ

—

コンテキスト長

200K

モダリティ

image, pdf, text

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Agents

Tau-bench

63.0%自己申告

BrowseComp

49.7%自己申告

Biology

GPQA

83.3%自己申告

Code

Aider-Polyglot

81.3%自己申告

SWE-Bench Verified

69.1%自己申告

Communication

Tau2 Retail

80.2%自己申告

Tau2 Airline

64.8%自己申告

Multi-Challenge

60.4%自己申告

Tau2 Telecom

58.2%自己申告

General

MMMU

82.9%自己申告

MMMU-Pro

76.4%自己申告

Healthcare

VideoMMMU

83.3%自己申告

Language

COLLIE

98.4%自己申告

Math

AIME 2024

91.6%自己申告

MathVista

86.8%自己申告

AIME 2025

86.4%自己申告

FrontierMath

15.8%自己申告

Humanity's Last Exam

14.7%自己申告

Multimodal

CharXiv-R

78.6%自己申告

Reasoning

ARC-AGI

88.0%自己申告

ERQA

64.0%自己申告

ARC-AGI v2

6.5%自己申告

AA評価指数

Math Index

88.3

Intelligence Index

30.4

Math 500

1.0

Aime

0.9

Aime 25

0.9

Mmlu Pro

0.9

Gpqa

0.8

Livecodebench

0.8

Tau2

0.8

Ifbench

0.7

Lcr

0.7

Scicode

0.4

Terminalbench Hard

0.4

Hle

0.2

LLM Statsカテゴリスコア

Language

100

Writing

100

Multimodal

Physics

General

Healthcare

Biology

Chemistry

Code

Reasoning

Frontend Development

Communication

Tool Calling

Math

Agents

Vision

Spatial Reasoning

価格設定

入力価格$2 / 1Mトークン

出力価格$8 / 1Mトークン

混合価格（3:1）$3.5 / 1Mトークン

キャッシュ読み取り価格$0.5 / 1Mトークン

速度

トークン/秒168.9

初トークン遅延6.19s

初回答遅延6.19s

プロバイダー価格ランキング

16 プロバイダー

最安: Poe最高: Jiekou.AI

プロバイダー入力出力

1Poe最安

$1.8

$7.2

2OpenAIプライマリ

3NanoGPT

4Abacus

5OpenRouter

6Kilo Gateway

7Cloudflare AI Gateway

8Helicone

9Azure Cognitive Services

10DigitalOcean

11Vercel AI Gateway

12LLM Gateway

13Azure

14NEAR AI Cloud

15Merge Gateway

16Jiekou.AI

$10

$40

このモデルの異なるAPIプロバイダー間の価格を比較。

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
エージェント能力	48	57.0	LS
コーディングランキング	30	80.0	AA
総合ランキング	64	72.0	AA
数学的推論	28	92.0	AA
マルチモーダルランキング	38	79.0	LS
推論	86	53.0	LS
科学	87	63.0	AA