gpt-oss-20B (low)

OpenAI

Description

The gpt-oss-20b model (technically 20.9B parameters) achieves near-parity with OpenAI o4-mini on core reasoning benchmarks, while running efficiently on a single 80 GB GPU. The gpt-oss-20b model delivers similar results to OpenAI o3‑mini on common benchmarks and can run on edge devices with just 16 GB of memory, making it ideal for on-device use cases, local inference, or rapid iteration without costly infrastructure. Both models also perform strongly on tool use, few-shot function calling, CoT reasoning (as seen in results on the Tau-Bench agentic evaluation suite) and HealthBench (even outperforming proprietary models like OpenAI o1 and GPT‑4o). Note: While referred to as '20b' for simplicity, it technically has 20.9B parameters.

Release Date

2025-08-05

Parameters

—

Context Length

131K

Modalities

text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Code Ranking	241	40.0	AA
General Ranking	203	48.0	AA
Math Reasoning	142	63.0	AA
Science	272	41.0	AA

Benchmark Scores (LLM Stats)

Biology

GPQA

71.5%SR

Communication

TAU-bench Retail

54.8%SR

Finance

MMLU

85.3%SR

Healthcare

HealthBench

42.5%SR

HealthBench Hard

10.8%SR

Math

CodeForces

0.74 / 3000SR

Humanity's Last Exam

10.9%SR

AA Evaluation Indices

Math Index

62.3

Intelligence Index

14.3

Mmlu Pro

0.7

Livecodebench

0.7

Aime 25

0.6

Gpqa

0.6

Ifbench

0.6

Tau2

0.5

Scicode

0.3

Lcr

0.3

Hle

0.1

Terminalbench Hard

0.0

LLM Stats Category Scores

Language

Legal

Finance

General

Physics

Biology

Chemistry

Math

Reasoning

Healthcare

Communication

Tool Calling

Vision

Pricing

Input Price$0.06 / 1M tokens

Output Price$0.2 / 1M tokens

Blended Price (3:1)$0.095 / 1M tokens

Speed

Tokens/sec265.4

Time to First Token0.50s

Time to Answer8.04s

Provider Price Ranking

13 providers

Cheapest: OpenRouterMost Expensive: Groq

ProviderInputOutput

1OpenRouterCheapest

$0.029

$0.14

2IO.NET

$0.03

$0.14

3Deep Infra

$0.03

$0.14

4Kilo Gateway

$0.03

$0.14

5NanoGPT

$0.04

$0.15

6NovitaAI

$0.04

$0.15

7SiliconFlow

$0.04

$0.18

8Weights & Biases

$0.05

$0.2

9Vercel AI Gateway

$0.05

$0.2

10FastRouter

$0.05

$0.2

11Together AI

$0.05

$0.2

12OpenAIPRIMARY

$0.06

$0.2

13Groq

$0.075

$0.3

Compare pricing across different API providers for this model.

External Sources

Artificial Analysis