o1-preview
OpenAIOpenAI o-seriesProprietary
描述
A research preview model focused on mathematical and logical reasoning capabilities, demonstrating improved performance on tasks requiring step-by-step reasoning, mathematical problem-solving, and code generation. The model shows enhanced capabilities in formal reasoning while maintaining strong general capabilities.
發布日期
2024-09-12
參數規模
—
上下文長度
200K
支援模態
file, image, text
能力雷達圖
24
general
34
coding
92
reasoning
60
science估算
0
agents
80
multimodal
Science 在缺少專門科學評測時使用推理能力代理估算。
排行榜排名
基準測試分數 (LLM Stats)
Biology
GPQA
73.3%自報
Code
SWE-Bench Verified
41.3%自報
Factuality
SimpleQA
42.4%自報
Finance
MMLU
90.8%自報
General
LiveBench
52.3%自報
Math
MGSM
90.8%自報
MATH
85.5%自報
AIME 2024
42.0%自報
AA 評測指數
Coding Index34.0
Intelligence Index23.7
Math 5000.9
LLM Stats 分類評分
Finance90
Healthcare90
Language90
Legal90
Biology70
Chemistry70
Math70
Physics70
General60
Reasoning60
Code40
Factuality40
Frontend Development40
定價
輸入價格$16.5 / 1M tokens
輸出價格$66 / 1M tokens
混合價格(3:1)$28.875 / 1M tokens
速度
Tokens/秒0.0 tokens/s
首Token延遲0.00s
首回答延遲0.00s
可用提供商
(LS 內部計價單位)暫無提供商資料