gpt-oss-20B (high)

OpenAIOpen WeightApache 2.0 · Commercial OK

Description

The gpt-oss-20b model (technically 20.9B parameters) achieves near-parity with OpenAI o4-mini on core reasoning benchmarks, while running efficiently on a single 80 GB GPU. The gpt-oss-20b model delivers similar results to OpenAI o3‑mini on common benchmarks and can run on edge devices with just 16 GB of memory, making it ideal for on-device use cases, local inference, or rapid iteration without costly infrastructure. Both models also perform strongly on tool use, few-shot function calling, CoT reasoning (as seen in results on the Tau-Bench agentic evaluation suite) and HealthBench (even outperforming proprietary models like OpenAI o1 and GPT‑4o). Note: While referred to as '20b' for simplicity, it technically has 20.9B parameters.

Release Date

2025-08-05

Parameters

20.9B

Context Length

131K

Modalities

text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Code Ranking	248	39.0	AA
General Ranking	171	53.0	AA
Math Reasoning	39	90.0	AA
Science	201	48.0	AA

Benchmark Scores (LLM Stats)

Biology

GPQA

71.5%SR

Communication

TAU-bench Retail

54.8%SR

Finance

MMLU

85.3%SR

Healthcare

HealthBench

42.5%SR

HealthBench Hard

10.8%SR

Math

CodeForces

0.74 / 3000SR

Humanity's Last Exam

10.9%SR

AA Evaluation Indices

Math Index

89.3

Coding Index

20.7

Intelligence Index

14.9

Aime 25

0.9

Livecodebench

0.8

Mmlu Pro

0.7

Gpqa

0.7

Ifbench

0.7

Tau2

0.6

Scicode

0.3

Lcr

0.3

Terminalbench V2 1

0.1

Terminalbench Hard

0.1

Hle

0.1

Tau Banking

0.1

LLM Stats Category Scores

Language

Legal

Finance

General

Physics

Biology

Chemistry

Math

Reasoning

Healthcare

Communication

Tool Calling

Vision

Pricing

Input Price$0.05 / 1M tokens

Output Price$0.2 / 1M tokens

Blended Price (3:1)$0.088 / 1M tokens

Speed

Tokens/sec233.2

Time to First Token0.66s

Time to Answer9.23s

Provider Price Ranking

16 providers

Cheapest: LLM GatewayMost Expensive: Regolo AI

ProviderInputOutput

1LLM GatewayCheapest

$0.04

$0.15

2Clarifai

$0.045

$0.18

3Helicone

$0.05

$0.2

4OpenAIPRIMARY

$0.05

$0.2

5DigitalOcean

$0.05

$0.45

6OVHcloud AI Endpoints

$0.05

$0.18

7Databricks

$0.05

$0.2

8Neon

$0.05

$0.2

9Fireworks AI

$0.07

$0.3

10Amazon Bedrock

$0.07

$0.3

11FrogBot

$0.07

$0.2

12Vertex

$0.07

$0.25

13NanoGPT

$0.2

$0.8

14Cloudflare AI Gateway

$0.2

$0.3

15Cloudflare Workers AI

$0.2

$0.3

16Regolo AI

$0.4

$1.8

Compare pricing across different API providers for this model.

External Sources

LLM Stats Artificial Analysis