GPT-5.1 (high)

OpenAIGPTProprietary

描述

The best model for coding and agentic tasks with configurable reasoning effort. GPT-5.1 is our flagship model for coding and agentic tasks with configurable reasoning and non-reasoning effort.

发布日期

2025-11-13

参数规模

—

上下文长度

400K

支持模态

image, text

能力雷达图

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域	#排名	分数	来源
代码能力榜	12	89.0	AA
通用能力榜	29	79.0	AA
数学推理	17	95.0	AA
推理能力	8	90.0	LS
科学能力	48	71.0	AA

基准测试分数 (LLM Stats)

Biology

GPQA

88.1%自报

Code

SWE-Bench Verified

76.3%自报

Communication

Tau2 Telecom

95.6%自报

Tau2 Retail

77.9%自报

Tau2 Airline

67.0%自报

General

MMMU

85.4%自报

Math

AIME 2025

94.0%自报

FrontierMath

26.7%自报

Reasoning

BrowseComp Long Context 128k

90.0%自报

AA 评测指数

Math Index

94.0

Intelligence Index

38.9

Aime 25

0.9

Gpqa

0.9

Mmlu Pro

0.9

Livecodebench

0.9

Tau2

0.8

Lcr

0.8

Ifbench

0.7

Terminalbench Hard

0.5

Scicode

0.4

Hle

0.3

LLM Stats 分类评分

Multimodal

Physics

General

Healthcare

Biology

Chemistry

Vision

Reasoning

Frontend Development

Code

Communication

Tool Calling

Math

定价

输入价格$1.25 / 1M tokens

输出价格$10 / 1M tokens

混合价格(3:1)$3.438 / 1M tokens

缓存读取价格$0.125 / 1M tokens

速度

Tokens/秒112.1

首Token延迟42.37s

首回答延迟42.37s

供应商价格排行

13 个供应商

最便宜: OpenAI最贵: Neon

供应商输入输出

1OpenAI最便宜

$0.00001

2Poe

$1.1

3NanoGPT

$1.25

$10

4Perplexity Agent

$1.25

$10

5OpenRouter

$1.25

$10

6ZenMux

$1.25

$10

7Kilo Gateway

$1.25

$10

8Cloudflare AI Gateway

$1.25

$10

9Requesty

$1.25

$10

10NEAR AI Cloud

$1.25

$10

11OrcaRouter

$1.25

$10

12Merge Gateway

$1.25

$10

13Neon

$1.25

$10

比较该模型在不同 API 供应商之间的定价。

外部链接

LLM Stats Artificial Analysis