GPT-4.1 mini

OpenAIGPTProprietary

Description

GPT-4.1 mini provides a balance between intelligence, speed, and cost. It's a significant leap in small model performance, even beating GPT-4o in many benchmarks while reducing latency and cost.

Release Date

2025-04-14

Parameters

—

Context Length

1.0M

Modalities

image, pdf, text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Code Ranking	244	40.0	AA
General Ranking	222	46.0	AA
Math Reasoning	160	56.0	AA
Multimodal Ranking	54	75.0	LS
Reasoning	65	62.0	LS
Science	215	47.0	AA

Benchmark Scores (LLM Stats)

Biology

GPQA

65.0%SR

Code

Aider-Polyglot

34.7%SR

Aider-Polyglot Edit

31.6%SR

SWE-Bench Verified

23.6%SR

Communication

Multi-IF

67.0%SR

TAU-bench Retail

55.8%SR

TAU-bench Airline

36.0%SR

Multi-Challenge

35.8%SR

Finance

MMLU

87.5%SR

General

IFEval

84.1%SR

MMMLU

78.5%SR

MMMU

72.7%SR

Internal API instruction following (hard)

45.1%SR

Language

COLLIE

54.6%SR

Long Context

ComplexFuncBench

49.3%SR

OpenAI-MRCR: 2 needle 128k

47.2%SR

OpenAI-MRCR: 2 needle 1M

33.3%SR

Graphwalks BFS >128k

15.0%SR

Graphwalks parents >128k

11.0%SR

Math

MathVista

73.1%SR

AIME 2024

49.6%SR

AIME 2025

40.2%SR

HMMT 2025

35.0%SR

Humanity's Last Exam

3.7%SR

Multimodal

CharXiv-D

88.4%SR

CharXiv-R

56.8%SR

Reasoning

Graphwalks BFS <128k

61.7%SR

Graphwalks parents <128k

60.5%SR

AA Evaluation Indices

Math Index

46.3

Intelligence Index

16.3

Math 500

0.9

Mmlu Pro

0.8

Gpqa

0.7

Tau2

0.5

Livecodebench

0.5

Aime 25

0.5

Aime

0.4

Lcr

0.4

Scicode

0.4

Ifbench

0.4

Terminalbench Hard

0.1

Hle

0.0

LLM Stats Category Scores

Legal

Finance

Instruction Following

Healthcare

Language

Multimodal

Physics

Structured Output

Biology

Chemistry

General

Vision

Math

Reasoning

Communication

Tool Calling

Writing

Spatial Reasoning

Long Context

Code

Frontend Development

Pricing

Input Price$0.4 / 1M tokens

Output Price$1.6 / 1M tokens

Blended Price (3:1)$0.7 / 1M tokens

Cache Read Price$0.1 / 1M tokens

Speed

Tokens/sec98.8

Time to First Token0.52s

Time to Answer0.52s

Provider Price Ranking

17 providers

Cheapest: OpenAIMost Expensive: Merge Gateway

ProviderInputOutput

1OpenAICheapest

2Poe

$0.36

$1.4

3Helicone

$0.4

$1.6

4302.AI

$0.4

$1.6

5NanoGPT

$0.4

$1.6

6Abacus

$0.4

$1.6

7OpenRouter

$0.4

$1.6

8Kilo Gateway

$0.4

$1.6

9SAP AI Core

$0.4

$1.6

10Azure Cognitive Services

$0.4

$1.6

11Requesty

$0.4

$1.6

12Vercel AI Gateway

$0.4

$1.6

13LLM Gateway

$0.4

$1.6

14Azure

$0.4

$1.6

15NEAR AI Cloud

$0.4

$1.6

16OrcaRouter

$0.4

$1.6

17Merge Gateway

$0.4

$1.6

Compare pricing across different API providers for this model.

External Sources

LLM Stats Artificial Analysis