跳轉到主要內容

gpt-oss-20B (low)

OpenAI

描述

The gpt-oss-20b model (technically 20.9B parameters) achieves near-parity with OpenAI o4-mini on core reasoning benchmarks, while running efficiently on a single 80 GB GPU. The gpt-oss-20b model delivers similar results to OpenAI o3‑mini on common benchmarks and can run on edge devices with just 16 GB of memory, making it ideal for on-device use cases, local inference, or rapid iteration without costly infrastructure. Both models also perform strongly on tool use, few-shot function calling, CoT reasoning (as seen in results on the Tau-Bench agentic evaluation suite) and HealthBench (even outperforming proprietary models like OpenAI o1 and GPT‑4o). Note: While referred to as '20b' for simplicity, it technically has 20.9B parameters.

發布日期
2025-08-05
參數規模
上下文長度
131K
支援模態
text

能力雷達圖

30
general
58
coding
62
reasoning
40
science估算
50
agents
0
multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域#排名分數來源
程式碼能力榜241
40.0
AA
通用能力榜203
48.0
AA
數學推理142
63.0
AA
科學能力272
41.0
AA

基準測試分數 (LLM Stats)

Biology

GPQA71.5%自報

Communication

TAU-bench Retail54.8%自報

Finance

MMLU85.3%自報

Healthcare

HealthBench42.5%自報
HealthBench Hard10.8%自報

Math

CodeForces0.74 / 3000自報
Humanity's Last Exam10.9%自報

AA 評測指數

Math Index
62.3
Intelligence Index
14.3
Mmlu Pro
0.7
Livecodebench
0.7
Aime 25
0.6
Gpqa
0.6
Ifbench
0.6
Tau2
0.5
Scicode
0.3
Lcr
0.3
Hle
0.1
Terminalbench Hard
0.0

LLM Stats 分類評分

Language
90
Legal
90
Finance
90
General
80
Physics
70
Biology
70
Chemistry
70
Math
60
Reasoning
60
Healthcare
50
Communication
50
Tool Calling
50
Vision
10

定價

輸入價格$0.06 / 1M tokens
輸出價格$0.2 / 1M tokens
混合價格(3:1)$0.095 / 1M tokens

速度

Tokens/秒265.4
首Token延遲0.50s
首回答延遲8.04s

供應商價格排行

供應商價格排行

13 個供應商

最便宜: OpenRouter最貴: Groq
供應商輸入輸出
1OpenRouter最便宜
$0.029
$0.14
2IO.NET
$0.03
$0.14
3Deep Infra
$0.03
$0.14
4Kilo Gateway
$0.03
$0.14
5NanoGPT
$0.04
$0.15
6NovitaAI
$0.04
$0.15
7SiliconFlow
$0.04
$0.18
8Weights & Biases
$0.05
$0.2
9Vercel AI Gateway
$0.05
$0.2
10FastRouter
$0.05
$0.2
11Together AI
$0.05
$0.2
12OpenAI主要
$0.06
$0.2
13Groq
$0.075
$0.3

比較該模型在不同 API 供應商之間的定價。

外部連結