Skip to main content

gpt-oss-20B (low)

OpenAI

Description

The gpt-oss-20b model (technically 20.9B parameters) achieves near-parity with OpenAI o4-mini on core reasoning benchmarks, while running efficiently on a single 80 GB GPU. The gpt-oss-20b model delivers similar results to OpenAI o3‑mini on common benchmarks and can run on edge devices with just 16 GB of memory, making it ideal for on-device use cases, local inference, or rapid iteration without costly infrastructure. Both models also perform strongly on tool use, few-shot function calling, CoT reasoning (as seen in results on the Tau-Bench agentic evaluation suite) and HealthBench (even outperforming proprietary models like OpenAI o1 and GPT‑4o). Note: While referred to as '20b' for simplicity, it technically has 20.9B parameters.

Release Date
2025-08-05
Parameters
Context Length
131K
Modalities
text

Capability Radar

30
general
58
coding
62
reasoning
40
scienceest.
50
agents
0
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking241
40.0
AA
General Ranking203
48.0
AA
Math Reasoning142
63.0
AA
Science272
41.0
AA

Benchmark Scores (LLM Stats)

Biology

GPQA71.5%SR

Communication

TAU-bench Retail54.8%SR

Finance

MMLU85.3%SR

Healthcare

HealthBench42.5%SR
HealthBench Hard10.8%SR

Math

CodeForces0.74 / 3000SR
Humanity's Last Exam10.9%SR

AA Evaluation Indices

Math Index
62.3
Intelligence Index
14.3
Mmlu Pro
0.7
Livecodebench
0.7
Aime 25
0.6
Gpqa
0.6
Ifbench
0.6
Tau2
0.5
Scicode
0.3
Lcr
0.3
Hle
0.1
Terminalbench Hard
0.0

LLM Stats Category Scores

Language
90
Legal
90
Finance
90
General
80
Physics
70
Biology
70
Chemistry
70
Math
60
Reasoning
60
Healthcare
50
Communication
50
Tool Calling
50
Vision
10

Pricing

Input Price$0.06 / 1M tokens
Output Price$0.2 / 1M tokens
Blended Price (3:1)$0.095 / 1M tokens

Speed

Tokens/sec265.4
Time to First Token0.50s
Time to Answer8.04s

Provider Price Ranking

Provider Price Ranking

13 providers

Cheapest: OpenRouterMost Expensive: Groq
ProviderInputOutput
1OpenRouterCheapest
$0.029
$0.14
2IO.NET
$0.03
$0.14
3Deep Infra
$0.03
$0.14
4Kilo Gateway
$0.03
$0.14
5NanoGPT
$0.04
$0.15
6NovitaAI
$0.04
$0.15
7SiliconFlow
$0.04
$0.18
8Weights & Biases
$0.05
$0.2
9Vercel AI Gateway
$0.05
$0.2
10FastRouter
$0.05
$0.2
11Together AI
$0.05
$0.2
12OpenAIPRIMARY
$0.06
$0.2
13Groq
$0.075
$0.3

Compare pricing across different API providers for this model.

External Sources