跳转到主要内容

GPT-4.1 nano

OpenAIGPTProprietary

描述

GPT-4.1 nano is OpenAI's fastest and cheapest model available in the GPT-4.1 family. It delivers exceptional performance at a small size with its 1 million token context window. Ideal for tasks like classification or autocompletion.

发布日期
2025-04-14
参数规模
上下文长度
1.0M
支持模态
file, image, text

能力雷达图

27
general
20
coding
36
reasoning
33
science估算
10
agents
85
multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域#排名分数来源
代码能力榜341
20.0
AA
通用能力榜375
29.0
AA
数学推理242
36.0
AA
多模态榜69
60.0
LS
推理能力102
17.0
LS
科学能力329
33.0
AA

基准测试分数 (LLM Stats)

Biology

GPQA50.3%自报

Code

Aider-Polyglot9.8%自报
Aider-Polyglot Edit6.2%自报

Communication

Multi-IF57.2%自报
TAU-bench Retail22.6%自报
Multi-Challenge15.0%自报
TAU-bench Airline14.0%自报

Finance

MMLU80.1%自报

General

IFEval74.5%自报
MMMLU66.9%自报
MMMU55.4%自报
Internal API instruction following (hard)31.6%自报

Language

COLLIE42.5%自报

Long Context

OpenAI-MRCR: 2 needle 128k36.6%自报
OpenAI-MRCR: 2 needle 1M12.0%自报
ComplexFuncBench5.7%自报
Graphwalks parents >128k5.6%自报
Graphwalks BFS >128k2.9%自报

Math

MathVista56.2%自报
AIME 202429.4%自报

Multimodal

CharXiv-D73.9%自报
CharXiv-R40.5%自报

Reasoning

Graphwalks BFS <128k25.0%自报
Graphwalks parents <128k9.4%自报

AA 评测指数

Math Index
24.0
Intelligence Index
13.0
Coding Index
11.2
Math 500
0.8
Mmlu Pro
0.7
Gpqa
0.5
Livecodebench
0.3
Ifbench
0.3
Scicode
0.3
Aime 25
0.2
Aime
0.2
Tau2
0.2
Lcr
0.2
Hle
0.0
Terminalbench Hard
0.0

LLM Stats 分类评分

Finance
80
Legal
80
Healthcare
70
Instruction Following
70
Vision
60
Language
60
Math
60
Multimodal
60
Structured Output
50
Biology
50
Chemistry
50
General
50
Physics
50
Writing
40
Communication
30
Reasoning
30
Spatial Reasoning
10
Tool Calling
10
Code
10
Long Context
10

定价

输入价格$0.1 / 1M tokens
输出价格$0.4 / 1M tokens
混合价格(3:1)$0.175 / 1M tokens

速度

Tokens/秒121.7 tokens/s
首Token延迟0.52s
首回答延迟0.52s

可用提供商

(LS 内部计价单位)
提供商输入价格输出价格
OpenAI100K400K

外部链接