o3

OpenAIOpenAI o-seriesProprietary

설명

OpenAI's most powerful reasoning model. o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following. Use it to think through multi-step problems that involve analysis across text, code, and images.

출시일

2025-04-16

파라미터

—

컨텍스트 길이

200K

모달리티

image, pdf, text

능력 레이더

general

coding

reasoning

science추정

agents

multimodal

전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.

랭킹

도메인	#순위	점수	소스
에이전트형 역량	48	57.0	LS
코딩 랭킹	30	80.0	AA
종합 랭킹	64	72.0	AA
수학 추론	28	92.0	AA
멀티모달 랭킹	38	79.0	LS
추론	86	53.0	LS
과학	87	63.0	AA

벤치마크 점수 (LLM Stats)

Agents

Tau-bench

63.0%자체 보고

BrowseComp

49.7%자체 보고

Biology

GPQA

83.3%자체 보고

Code

Aider-Polyglot

81.3%자체 보고

SWE-Bench Verified

69.1%자체 보고

Communication

Tau2 Retail

80.2%자체 보고

Tau2 Airline

64.8%자체 보고

Multi-Challenge

60.4%자체 보고

Tau2 Telecom

58.2%자체 보고

General

MMMU

82.9%자체 보고

MMMU-Pro

76.4%자체 보고

Healthcare

VideoMMMU

83.3%자체 보고

Language

COLLIE

98.4%자체 보고

Math

AIME 2024

91.6%자체 보고

MathVista

86.8%자체 보고

AIME 2025

86.4%자체 보고

FrontierMath

15.8%자체 보고

Humanity's Last Exam

14.7%자체 보고

Multimodal

CharXiv-R

78.6%자체 보고

Reasoning

ARC-AGI

88.0%자체 보고

ERQA

64.0%자체 보고

ARC-AGI v2

6.5%자체 보고

AA 평가 지수

Math Index

88.3

Intelligence Index

30.4

Math 500

1.0

Aime

0.9

Aime 25

0.9

Mmlu Pro

0.9

Gpqa

0.8

Livecodebench

0.8

Tau2

0.8

Ifbench

0.7

Lcr

0.7

Scicode

0.4

Terminalbench Hard

0.4

Hle

0.2

LLM Stats 카테고리 점수

Language

100

Writing

100

Multimodal

Physics

General

Healthcare

Biology

Chemistry

Code

Reasoning

Frontend Development

Communication

Tool Calling

Math

Agents

Vision

Spatial Reasoning

가격

입력 가격$2 / 1M 토큰

출력 가격$8 / 1M 토큰

혼합 가격 (3:1)$3.5 / 1M 토큰

캐시 읽기 가격$0.5 / 1M 토큰

속도

토큰/초168.9

첫 토큰 지연6.19s

첫 응답 지연6.19s

공급자 가격 순위

16개 공급자

최저가: Poe최고가: Jiekou.AI

공급자입력출력

1Poe최저가

$1.8

$7.2

2OpenAI주요

3NanoGPT

4Abacus

5OpenRouter

6Kilo Gateway

7Cloudflare AI Gateway

8Helicone

9Azure Cognitive Services

10DigitalOcean

11Vercel AI Gateway

12LLM Gateway

13Azure

14NEAR AI Cloud

15Merge Gateway

16Jiekou.AI

$10

$40

이 모델의 다양한 API 공급자 간 가격 비교.

외부 링크

LLM Stats Artificial Analysis