o1

OpenAIOpenAI o-seriesProprietary

説明

A research preview model focused on mathematical and logical reasoning capabilities, demonstrating improved performance on tasks requiring step-by-step reasoning, mathematical problem-solving, and code generation. The model shows enhanced capabilities in formal reasoning while maintaining strong general capabilities.

リリース日

2024-12-05

パラメータ

—

コンテキスト長

200K

モダリティ

image, pdf, text

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Biology

GPQA

78.0%自己申告

GPQA Biology

69.2%自己申告

Chemistry

GPQA Chemistry

64.7%自己申告

Code

HumanEval

88.1%自己申告

SWE-Bench Verified

41.0%自己申告

Communication

TAU-bench Retail

70.8%自己申告

TAU-bench Airline

50.0%自己申告

Factuality

SimpleQA

47.0%自己申告

Finance

MMLU

91.8%自己申告

General

MMMLU

87.7%自己申告

MMMU

77.6%自己申告

LiveBench

67.0%自己申告

Math

GSM8k

97.1%自己申告

MATH

96.4%自己申告

MGSM

89.3%自己申告

AIME 2024

74.3%自己申告

MathVista

71.8%自己申告

FrontierMath

5.5%自己申告

Physics

GPQA Physics

92.8%自己申告

AA評価指数

Coding Index

39.7

Intelligence Index

23.4

Math 500

1.0

Mmlu Pro

0.8

Gpqa

0.7

Aime

0.7

Ifbench

0.7

Livecodebench

0.7

Tau2

0.6

Lcr

0.6

Scicode

0.4

Terminalbench Hard

0.1

Hle

0.1

LLM Statsカテゴリスコア

Language

Legal

Finance

Math

Physics

Healthcare

Biology

Chemistry

Multimodal

Reasoning

General

Vision

Code

Communication

Tool Calling

Factuality

Frontend Development

価格設定

入力価格$15 / 1Mトークン

出力価格$60 / 1Mトークン

混合価格（3:1）$26.25 / 1Mトークン

キャッシュ読み取り価格$7.5 / 1Mトークン

速度

トークン/秒147.9

初トークン遅延13.04s

初回答遅延13.04s

プロバイダー価格ランキング

13 プロバイダー

最安: Poe最高: Merge Gateway

プロバイダー入力出力

1Poe最安

$14

$54

2NanoGPT

$14.994

$59.993

3OpenAIプライマリ

$15

$60

4OpenRouter

$15

$60

5Kilo Gateway

$15

$60

6Cloudflare AI Gateway

$15

$60

7Helicone

$15

$60

8Azure Cognitive Services

$15

$60

9DigitalOcean

$15

$60

10Vercel AI Gateway

$15

$60

11LLM Gateway

$15

$60

12Azure

$15

$60

13Merge Gateway

$15

$60

このモデルの異なるAPIプロバイダー間の価格を比較。

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
コーディングランキング	151	55.0	AA
総合ランキング	105	63.0	AA
数学的推論	55	87.0	AA
科学	195	49.0	AA