GPT-5.5 (high)
OpenAIGPT
描述
GPT-5.5 is OpenAI's smartest model yet, designed for real work across agentic coding, computer use, knowledge work, and early scientific research. It matches GPT-5.4 per-token latency in real-world serving while reaching a much higher level of intelligence and using significantly fewer tokens to complete the same tasks. GPT-5.5 supports a 1M-token context window in the API and a 400K-token context window in Codex, with state-of-the-art results on Terminal-Bench 2.0, OSWorld-Verified, GDPval, FrontierMath, and CyberGym.
发布日期
2026-04-23
参数规模
—
上下文长度
1.1M
支持模态
image, pdf, text
能力雷达图
51
general
69
coding
93
reasoning
70
science估算
80
agents
85
multimodal
Science 在缺少专门科学评测时使用推理能力代理估算。
排行榜排名
基准测试分数 (LLM Stats)
Agents
GDPval-AA
1135.00 / 3000自报
BrowseComp
84.4%自报
Terminal-Bench 2.0
82.7%自报
CyberGym
81.8%自报
BixBench
80.5%自报
OSWorld-Verified
78.7%自报
MCP Atlas
75.3%自报
FrontierSWE
73.0%自报
Finance Agent
60.0%自报
SWE-Bench Pro
58.6%自报
Toolathlon
55.6%自报
OfficeQA Pro
54.1%自报
Finance Agent v2
51.8%自报
GeneBench
25.0%自报
Legal Agent Benchmark
2.1%自报
Biology
GPQA
93.6%自报
Communication
Tau2 Telecom
98.0%自报
Finance
GDPval-MM
84.9%自报
General
MMMU-Pro
83.2%自报
LiveBench
80.7%自报
MRCR v2 (8-needle)
74.0%自报
Long Context
Graphwalks parents >128k
58.5%自报
Graphwalks BFS >128k
45.4%自报
Math
Humanity's Last Exam
52.2%自报
FrontierMath
35.4%自报
Reasoning
ARC-AGI
95.0%自报
ARC-AGI v2
85.0%自报
AA 评测指数
Coding Index71.6
Intelligence Index53.1
Gpqa0.9
Tau20.9
Terminalbench V2 10.8
Lcr0.7
Ifbench0.7
Terminalbench Hard0.6
Scicode0.6
Hle0.4
Tau Banking0.3
LLM Stats 分类评分
Legal100
Finance100
General100
Agents88
Reasoning52
Communication100
Physics90
Biology90
Chemistry90
Multimodal80
Safety80
Search80
Tool Calling80
Vision80
Spatial Reasoning70
Code70
Long Context60
Math60
定价
输入价格$5 / 1M tokens
输出价格$30 / 1M tokens
混合价格(3:1)$11.25 / 1M tokens
缓存读取价格$0.5 / 1M tokens
速度
Tokens/秒79.5
首Token延迟16.43s
首回答延迟16.43s
供应商价格排行
供应商价格排行
1 个供应商
供应商输入输出
1OpenAI主要
$5
$30
比较该模型在不同 API 供应商之间的定价。