跳轉到主要內容

GPT-4.1

OpenAIGPTProprietary

描述

GPT-4.1 is OpenAI's latest and most advanced flagship model, significantly improving upon GPT-4 Turbo in performance across benchmarks, speed, and cost-effectiveness.

發布日期
2025-04-14
參數規模
上下文長度
1.0M
支援模態
image, pdf, text

能力雷達圖

36
general
44
coding
49
reasoning
44
science估算
60
agents
85
multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域#排名分數來源
程式碼能力榜177
51.0
AA
通用能力榜206
48.0
AA
數學推理188
48.0
AA
多模態榜58
74.0
LS
推理能力67
60.0
LS
科學能力227
46.0
AA

基準測試分數 (LLM Stats)

Biology

GPQA66.3%自報

Code

SWE-Bench Verified54.6%自報
Aider-Polyglot Edit52.9%自報
Aider-Polyglot51.6%自報

Communication

Multi-IF70.8%自報
TAU-bench Retail68.0%自報
TAU-bench Airline49.4%自報
Multi-Challenge38.3%自報

Finance

MMLU90.2%自報

General

IFEval87.4%自報
MMMLU87.3%自報
MMMU74.8%自報
Internal API instruction following (hard)49.1%自報

Language

COLLIE65.8%自報

Long Context

ComplexFuncBench65.5%自報
OpenAI-MRCR: 2 needle 128k57.2%自報
OpenAI-MRCR: 2 needle 1M46.3%自報
Graphwalks parents >128k25.0%自報
Graphwalks BFS >128k19.0%自報

Math

MathVista72.2%自報
AIME 202448.1%自報
AIME 202546.4%自報
HMMT 202528.9%自報
Humanity's Last Exam5.4%自報

Multimodal

CharXiv-D87.9%自報
Video-MME (long, no subtitles)72.0%自報
CharXiv-R56.7%自報

Reasoning

Graphwalks BFS <128k61.7%自報
Graphwalks parents <128k58.0%自報

AA 評測指數

Math Index
34.7
Intelligence Index
19.4
Math 500
0.9
Mmlu Pro
0.8
Gpqa
0.7
Lcr
0.6
Tau2
0.5
Livecodebench
0.5
Aime
0.4
Ifbench
0.4
Scicode
0.4
Aime 25
0.3
Terminalbench Hard
0.1
Hle
0.0

LLM Stats 分類評分

Legal
90
Finance
90
Instruction Following
80
Language
80
Healthcare
80
Multimodal
70
Physics
70
Structured Output
70
General
70
Biology
70
Chemistry
70
Writing
70
Reasoning
60
Communication
60
Tool Calling
60
Vision
60
Math
50
Frontend Development
50
Code
50
Long Context
40
Spatial Reasoning
40

定價

輸入價格$2 / 1M tokens
輸出價格$8 / 1M tokens
混合價格(3:1)$3.5 / 1M tokens
快取讀取價格$0.5 / 1M tokens

速度

Tokens/秒146.3
首Token延遲0.59s
首回答延遲0.59s

供應商價格排行

供應商價格排行

20 個供應商

最便宜: OpenAI最貴: Cortecs
供應商輸入輸出
1OpenAI最便宜
$0
$0.00001
2Poe
$1.8
$7.2
3302.AI
$2
$8
4NanoGPT
$2
$8
5Abacus
$2
$8
6OpenRouter
$2
$8
7Kilo Gateway
$2
$8
8SAP AI Core
$2
$8
9GitHub Copilot
$2
$8
10Helicone
$2
$8
11Azure Cognitive Services
$2
$8
12Requesty
$2
$8
13Vercel AI Gateway
$2
$8
14LLM Gateway
$2
$8
15Azure
$2
$8
16FastRouter
$2
$8
17NEAR AI Cloud
$2
$8
18OrcaRouter
$2
$8
19Merge Gateway
$2
$8
20Cortecs
$2.354
$9.417

比較該模型在不同 API 供應商之間的定價。

外部連結