Skip to main content

GPT-5.1 (high)

OpenAIGPTProprietary

Description

The best model for coding and agentic tasks with configurable reasoning effort. GPT-5.1 is our flagship model for coding and agentic tasks with configurable reasoning and non-reasoning effort.

Release Date
2025-11-13
Parameters
Context Length
400K
Modalities
file, image, text

Capability Radar

56
general
59
coding
93
reasoning
60
scienceest.
80
agents
90
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking14
84.0
AA
General Ranking28
84.0
AA
Math Reasoning17
95.0
AA
Reasoning8
90.0
LS
Science36
75.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA88.1%SR

Code

SWE-Bench Verified76.3%SR

Communication

Tau2 Telecom95.6%SR
Tau2 Retail77.9%SR
Tau2 Airline67.0%SR

General

MMMU85.4%SR

Math

AIME 202594.0%SR
FrontierMath26.7%SR

Reasoning

BrowseComp Long Context 128k90.0%SR

AA Evaluation Indices

Math Index
94.0
Intelligence Index
47.7
Coding Index
44.7
Aime 25
0.9
Gpqa
0.9
Mmlu Pro
0.9
Livecodebench
0.9
Tau2
0.8
Lcr
0.8
Ifbench
0.7
Terminalbench Hard
0.5
Scicode
0.4
Hle
0.3

LLM Stats Category Scores

Vision
90
Biology
90
Chemistry
90
General
90
Healthcare
90
Multimodal
90
Physics
90
Search
90
Tool Calling
80
Code
80
Communication
80
Frontend Development
80
Reasoning
80
Math
60

Pricing

Input Price$1.25 / 1M tokens
Output Price$10 / 1M tokens
Blended Price (3:1)$3.438 / 1M tokens

Speed

Tokens/sec153.0 tokens/s
Time to First Token23.77s
Time to Answer23.77s

Available Providers

(LS internal units)
ProviderInput PriceOutput Price
OpenAI1.3M10.0M

External Sources