GPT-5.1 (high)
OpenAIGPTProprietary
Description
The best model for coding and agentic tasks with configurable reasoning effort. GPT-5.1 is our flagship model for coding and agentic tasks with configurable reasoning and non-reasoning effort.
Release Date
2025-11-13
Parameters
—
Context Length
400K
Modalities
file, image, text
Capability Radar
56
general
59
coding
93
reasoning
60
scienceest.
80
agents
90
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 14 | 84.0 | AA |
| General Ranking | 28 | 84.0 | AA |
| Math Reasoning | 17 | 95.0 | AA |
| Reasoning | 8 | 90.0 | LS |
| Science | 36 | 75.0 | AA |
Benchmark Scores (LLM Stats)
Biology
GPQA
88.1%SR
Code
SWE-Bench Verified
76.3%SR
Communication
Tau2 Telecom
95.6%SR
Tau2 Retail
77.9%SR
Tau2 Airline
67.0%SR
General
MMMU
85.4%SR
Math
AIME 2025
94.0%SR
FrontierMath
26.7%SR
Reasoning
BrowseComp Long Context 128k
90.0%SR
AA Evaluation Indices
Math Index94.0
Intelligence Index47.7
Coding Index44.7
Aime 250.9
Gpqa0.9
Mmlu Pro0.9
Livecodebench0.9
Tau20.8
Lcr0.8
Ifbench0.7
Terminalbench Hard0.5
Scicode0.4
Hle0.3
LLM Stats Category Scores
Vision90
Biology90
Chemistry90
General90
Healthcare90
Multimodal90
Physics90
Search90
Tool Calling80
Code80
Communication80
Frontend Development80
Reasoning80
Math60
Pricing
Input Price$1.25 / 1M tokens
Output Price$10 / 1M tokens
Blended Price (3:1)$3.438 / 1M tokens
Speed
Tokens/sec153.0 tokens/s
Time to First Token23.77s
Time to Answer23.77s
Available Providers
(LS internal units)| Provider | Input Price | Output Price |
|---|---|---|
| OpenAI | 1.3M | 10.0M |