Skip to main content

GPT-4.1

OpenAIGPTProprietary

Description

GPT-4.1 is OpenAI's latest and most advanced flagship model, significantly improving upon GPT-4 Turbo in performance across benchmarks, speed, and cost-effectiveness.

Release Date
2025-04-14
Parameters
Context Length
1.0M
Modalities
image, pdf, text

Capability Radar

36
general
44
coding
49
reasoning
44
scienceest.
60
agents
85
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking177
51.0
AA
General Ranking206
48.0
AA
Math Reasoning188
48.0
AA
Multimodal Ranking58
74.0
LS
Reasoning67
60.0
LS
Science227
46.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA66.3%SR

Code

SWE-Bench Verified54.6%SR
Aider-Polyglot Edit52.9%SR
Aider-Polyglot51.6%SR

Communication

Multi-IF70.8%SR
TAU-bench Retail68.0%SR
TAU-bench Airline49.4%SR
Multi-Challenge38.3%SR

Finance

MMLU90.2%SR

General

IFEval87.4%SR
MMMLU87.3%SR
MMMU74.8%SR
Internal API instruction following (hard)49.1%SR

Language

COLLIE65.8%SR

Long Context

ComplexFuncBench65.5%SR
OpenAI-MRCR: 2 needle 128k57.2%SR
OpenAI-MRCR: 2 needle 1M46.3%SR
Graphwalks parents >128k25.0%SR
Graphwalks BFS >128k19.0%SR

Math

MathVista72.2%SR
AIME 202448.1%SR
AIME 202546.4%SR
HMMT 202528.9%SR
Humanity's Last Exam5.4%SR

Multimodal

CharXiv-D87.9%SR
Video-MME (long, no subtitles)72.0%SR
CharXiv-R56.7%SR

Reasoning

Graphwalks BFS <128k61.7%SR
Graphwalks parents <128k58.0%SR

AA Evaluation Indices

Math Index
34.7
Intelligence Index
19.4
Math 500
0.9
Mmlu Pro
0.8
Gpqa
0.7
Lcr
0.6
Tau2
0.5
Livecodebench
0.5
Aime
0.4
Ifbench
0.4
Scicode
0.4
Aime 25
0.3
Terminalbench Hard
0.1
Hle
0.0

LLM Stats Category Scores

Legal
90
Finance
90
Instruction Following
80
Language
80
Healthcare
80
Multimodal
70
Physics
70
Structured Output
70
General
70
Biology
70
Chemistry
70
Writing
70
Reasoning
60
Communication
60
Tool Calling
60
Vision
60
Math
50
Frontend Development
50
Code
50
Long Context
40
Spatial Reasoning
40

Pricing

Input Price$2 / 1M tokens
Output Price$8 / 1M tokens
Blended Price (3:1)$3.5 / 1M tokens
Cache Read Price$0.5 / 1M tokens

Speed

Tokens/sec146.3
Time to First Token0.59s
Time to Answer0.59s

Provider Price Ranking

Provider Price Ranking

20 providers

Cheapest: OpenAIMost Expensive: Cortecs
ProviderInputOutput
1OpenAICheapest
$0
$0.00001
2Poe
$1.8
$7.2
3302.AI
$2
$8
4NanoGPT
$2
$8
5Abacus
$2
$8
6OpenRouter
$2
$8
7Kilo Gateway
$2
$8
8SAP AI Core
$2
$8
9GitHub Copilot
$2
$8
10Helicone
$2
$8
11Azure Cognitive Services
$2
$8
12Requesty
$2
$8
13Vercel AI Gateway
$2
$8
14LLM Gateway
$2
$8
15Azure
$2
$8
16FastRouter
$2
$8
17NEAR AI Cloud
$2
$8
18OrcaRouter
$2
$8
19Merge Gateway
$2
$8
20Cortecs
$2.354
$9.417

Compare pricing across different API providers for this model.

External Sources