GPT-4.1

OpenAIGPTProprietary

Descripción

GPT-4.1 is OpenAI's latest and most advanced flagship model, significantly improving upon GPT-4 Turbo in performance across benchmarks, speed, and cost-effectiveness.

Fecha de lanzamiento

2025-04-14

Parámetros

—

Longitud del contexto

1.0M

Modalidades

image, pdf, text

Radar de capacidades

general

coding

reasoning

scienceest.

agents

multimodal

Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.

Rankings

Dominio	#Posición	Puntuación	Fuente
Ranking de codificación	177	51.0	AA
Ranking general	206	48.0	AA
Razonamiento matemático	188	48.0	AA
Ranking multimodal	58	74.0	LS
Razonamiento	67	60.0	LS
Ciencia	227	46.0	AA

Puntuaciones de benchmarks (LLM Stats)

Biology

GPQA

66.3%Aut.

Code

SWE-Bench Verified

54.6%Aut.

Aider-Polyglot Edit

52.9%Aut.

Aider-Polyglot

51.6%Aut.

Communication

Multi-IF

70.8%Aut.

TAU-bench Retail

68.0%Aut.

TAU-bench Airline

49.4%Aut.

Multi-Challenge

38.3%Aut.

Finance

MMLU

90.2%Aut.

General

IFEval

87.4%Aut.

MMMLU

87.3%Aut.

MMMU

74.8%Aut.

Internal API instruction following (hard)

49.1%Aut.

Language

COLLIE

65.8%Aut.

Long Context

ComplexFuncBench

65.5%Aut.

OpenAI-MRCR: 2 needle 128k

57.2%Aut.

OpenAI-MRCR: 2 needle 1M

46.3%Aut.

Graphwalks parents >128k

25.0%Aut.

Graphwalks BFS >128k

19.0%Aut.

Math

MathVista

72.2%Aut.

AIME 2024

48.1%Aut.

AIME 2025

46.4%Aut.

HMMT 2025

28.9%Aut.

Humanity's Last Exam

5.4%Aut.

Multimodal

CharXiv-D

87.9%Aut.

Video-MME (long, no subtitles)

72.0%Aut.

CharXiv-R

56.7%Aut.

Reasoning

Graphwalks BFS <128k

61.7%Aut.

Graphwalks parents <128k

58.0%Aut.

Índices de evaluación AA

Math Index

34.7

Intelligence Index

19.4

Math 500

0.9

Mmlu Pro

0.8

Gpqa

0.7

Lcr

0.6

Tau2

0.5

Livecodebench

0.5

Aime

0.4

Ifbench

0.4

Scicode

0.4

Aime 25

0.3

Terminalbench Hard

0.1

Hle

0.0

Puntuaciones por categoría LLM Stats

Legal

Finance

Instruction Following

Language

Healthcare

Multimodal

Physics

Structured Output

General

Biology

Chemistry

Writing

Reasoning

Communication

Tool Calling

Vision

Math

Frontend Development

Code

Long Context

Spatial Reasoning

Precios

Precio de entrada$2 / 1M tokens

Precio de salida$8 / 1M tokens

Precio mixto (3:1)$3.5 / 1M tokens

Precio de lectura caché$0.5 / 1M tokens

Velocidad

Tokens/seg146.3

Retraso del primer token0.59s

Tiempo hasta la respuesta0.59s

Ranking de Precios por Proveedor

20 proveedores

Más barato: OpenAIMás caro: Cortecs

ProveedorEntradaSalida

1OpenAIMás barato

$0.00001

2Poe

$1.8

$7.2

3302.AI

4NanoGPT

5Abacus

6OpenRouter

7Kilo Gateway

8SAP AI Core

9GitHub Copilot

10Helicone

11Azure Cognitive Services

12Requesty

13Vercel AI Gateway

14LLM Gateway

15Azure

16FastRouter

17NEAR AI Cloud

18OrcaRouter

19Merge Gateway

20Cortecs

$2.354

$9.417

Comparar precios entre diferentes proveedores de API para este modelo.

Fuentes externas

LLM Stats Artificial Analysis