Passer au contenu principal

gpt-oss-20B (high)

OpenAIOpen WeightApache 2.0 · Usage Commercial

Description

The gpt-oss-20b model (technically 20.9B parameters) achieves near-parity with OpenAI o4-mini on core reasoning benchmarks, while running efficiently on a single 80 GB GPU. The gpt-oss-20b model delivers similar results to OpenAI o3‑mini on common benchmarks and can run on edge devices with just 16 GB of memory, making it ideal for on-device use cases, local inference, or rapid iteration without costly infrastructure. Both models also perform strongly on tool use, few-shot function calling, CoT reasoning (as seen in results on the Tau-Bench agentic evaluation suite) and HealthBench (even outperforming proprietary models like OpenAI o1 and GPT‑4o). Note: While referred to as '20b' for simplicity, it technically has 20.9B parameters.

Date de sortie
2025-08-05
Paramètres
20.9B
Longueur du contexte
131K
Modalités
text

Radar de capacités

32
general
42
coding
86
reasoning
45
scienceest.
50
agents
0
multimodal

Science utilise un proxy de raisonnement lorsque les benchmarks scientifiques dédiés ne sont pas disponibles.

Classements

Domaine#RangScoreSource
Classement codage248
39.0
AA
Classement général171
53.0
AA
Raisonnement mathématique39
90.0
AA
Science201
48.0
AA

Scores de benchmarks (LLM Stats)

Biology

GPQA71.5%Aut.

Communication

TAU-bench Retail54.8%Aut.

Finance

MMLU85.3%Aut.

Healthcare

HealthBench42.5%Aut.
HealthBench Hard10.8%Aut.

Math

CodeForces0.74 / 3000Aut.
Humanity's Last Exam10.9%Aut.

Indices d'évaluation AA

Math Index
89.3
Coding Index
20.7
Intelligence Index
14.9
Aime 25
0.9
Livecodebench
0.8
Mmlu Pro
0.7
Gpqa
0.7
Ifbench
0.7
Tau2
0.6
Scicode
0.3
Lcr
0.3
Terminalbench V2 1
0.1
Terminalbench Hard
0.1
Hle
0.1
Tau Banking
0.1

Scores par catégorie LLM Stats

Language
90
Legal
90
Finance
90
General
80
Physics
70
Biology
70
Chemistry
70
Math
60
Reasoning
60
Healthcare
50
Communication
50
Tool Calling
50
Vision
10

Tarification

Prix d'entrée$0.05 / 1M tokens
Prix de sortie$0.2 / 1M tokens
Prix mixte (3:1)$0.088 / 1M tokens

Vitesse

Tokens/sec233.2
Délai du premier token0.66s
Temps de réponse9.23s

Classement des Prix par Fournisseur

Classement des Prix par Fournisseur

16 fournisseurs

Moins cher: LLM GatewayPlus cher: Regolo AI
FournisseurEntréeSortie
1LLM GatewayMoins cher
$0.04
$0.15
2Clarifai
$0.045
$0.18
3Helicone
$0.05
$0.2
4OpenAIPRINCIPAL
$0.05
$0.2
5DigitalOcean
$0.05
$0.45
6OVHcloud AI Endpoints
$0.05
$0.18
7Databricks
$0.05
$0.2
8Neon
$0.05
$0.2
9Fireworks AI
$0.07
$0.3
10Amazon Bedrock
$0.07
$0.3
11FrogBot
$0.07
$0.2
12Vertex
$0.07
$0.25
13NanoGPT
$0.2
$0.8
14Cloudflare AI Gateway
$0.2
$0.3
15Cloudflare Workers AI
$0.2
$0.3
16Regolo AI
$0.4
$1.8

Comparer les prix entre différents fournisseurs API pour ce modèle.

Sources externes