GPT-5.3 Codex (xhigh)
OpenAIGPTProprietary
Description
GPT-5.3-Codex is OpenAI's most capable coding model, combining frontier agentic coding capabilities, improvements in aesthetics, and context compaction. It sets new state-of-the-art results on Terminal-Bench 2.0 (77.3%), OSWorld-Verified (64.7%), and SWE-Lancer IC Diamond (81.4%). First model classified as High capability for cybersecurity under OpenAI's Preparedness Framework. Available in the Codex app and API.
Release Date
2026-02-05
Parameters
—
Context Length
400K
Modalities
file, image, text
Capability Radar
51
general
53
coding
92
reasoning
68
scienceest.
80
agents
85
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Agents & Tools | 23 | 67.0 | LS |
| Code Ranking | 6 | 91.0 | AA |
| General Ranking | 13 | 88.0 | AA |
| Science | 7 | 92.0 | AA |
Benchmark Scores (LLM Stats)
Agents
Terminal-Bench 2.0
77.3%SR
OSWorld-Verified
64.7%SR
SWE-Bench Pro
56.8%SR
Code
SWE-Lancer (IC-Diamond subset)
81.4%SR
Safety
Cybersecurity CTFs
77.6%SR
AA Evaluation Indices
Intelligence Index53.6
Coding Index53.1
Gpqa0.9
Tau20.9
Ifbench0.8
Lcr0.7
Scicode0.5
Terminalbench Hard0.5
Hle0.4
LLM Stats Category Scores
Tool Calling80
Safety80
Agents70
Code70
Reasoning70
Vision60
General60
Multimodal60
Pricing
Input Price$1.75 / 1M tokens
Output Price$14 / 1M tokens
Blended Price (3:1)$4.813 / 1M tokens
Speed
Tokens/sec83.3 tokens/s
Time to First Token58.63s
Time to Answer58.63s
Available Providers
(LS internal units)| Provider | Input Price | Output Price |
|---|---|---|
| OpenAI | 1.8M | 14.0M |