GPT-5.4 Pro (xhigh)
OpenAIGPT
发布日期
2026-03-05
参数规模
—
上下文长度
1.1M
支持模态
image, text
能力雷达图
80
general
90
coding
70
reasoning
77
science估算
80
agents
90
multimodal
Science 在缺少专门科学评测时使用推理能力代理估算。
排行榜排名
暂无排名数据
基准测试分数 (LLM Stats)
Agents
BrowseComp
54.9%自报
Biology
GPQA
85.7%自报
Code
SWE-Lancer (IC-Diamond subset)
100.0%自报
HumanEval
93.4%自报
Aider-Polyglot
88.0%自报
SWE-Bench Verified
74.9%自报
Communication
Tau2 Telecom
96.7%自报
Tau2 Retail
81.1%自报
Multi-Challenge
69.6%自报
Tau2 Airline
62.6%自报
Finance
MMLU
92.5%自报
General
MMMU
84.2%自报
MMMU-Pro
78.4%自报
Internal API instruction following (hard)
64.0%自报
LongFact Objects
0.8%自报
LongFact Concepts
0.7%自报
Healthcare
VideoMMMU
84.6%自报
HealthBench Hard
1.6%自报
Language
COLLIE
99.0%自报
Long Context
OpenAI-MRCR: 2 needle 128k
95.2%自报
OpenAI-MRCR: 2 needle 256k
86.8%自报
Math
AIME 2025
94.6%自报
HMMT 2025
93.3%自报
MATH
84.7%自报
FrontierMath
26.3%自报
Humanity's Last Exam
24.8%自报
Multimodal
VideoMME w sub.
86.7%自报
CharXiv-R
81.1%自报
Reasoning
BrowseComp Long Context 128k
90.0%自报
BrowseComp Long Context 256k
88.8%自报
Graphwalks BFS <128k
78.3%自报
Graphwalks parents <128k
73.3%自报
ERQA
65.7%自报
FActScore
1.0%自报
AA 评测指数
暂无 AA 评测数据
LLM Stats 分类评分
Language100
Long Context100
Writing100
Legal90
Physics90
Finance90
Biology90
Chemistry90
Code90
Video90
Reasoning80
General80
Communication80
Tool Calling80
Math70
Multimodal70
Search70
Frontend Development70
Healthcare70
Vision70
Spatial Reasoning60
Structured Output60
Agents50
Robotics20
定价
输入价格$30 / 1M tokens
输出价格$180 / 1M tokens
混合价格(3:1)$67.5 / 1M tokens
速度
Tokens/秒0.0
首Token延迟0.00s
首回答延迟0.00s
供应商价格排行
供应商价格排行
18 个供应商
最便宜: Jiekou.AI最贵: ZenMux
供应商输入输出
1Jiekou.AI最便宜
$13.5
$108
2Poe
$14
$110
3OpenAI
$15
$120
4302.AI
$15
$120
5NanoGPT
$15
$120
6OpenRouter
$15
$120
7Kilo Gateway
$15
$120
8Helicone
$15
$120
9Requesty
$15
$120
10Vercel AI Gateway
$15
$120
11LLM Gateway
$15
$120
12Azure
$15
$120
13OrcaRouter
$15
$120
14OpenCode Zen
$30
$180
15Azure Cognitive Services
$30
$180
16DigitalOcean
$30
$180
17Venice AI
$37.5
$225
18ZenMux
$45
$225
比较该模型在不同 API 供应商之间的定价。