gpt-oss-20B (high)
Description
The gpt-oss-20b model (technically 20.9B parameters) achieves near-parity with OpenAI o4-mini on core reasoning benchmarks, while running efficiently on a single 80 GB GPU. The gpt-oss-20b model delivers similar results to OpenAI o3‑mini on common benchmarks and can run on edge devices with just 16 GB of memory, making it ideal for on-device use cases, local inference, or rapid iteration without costly infrastructure. Both models also perform strongly on tool use, few-shot function calling, CoT reasoning (as seen in results on the Tau-Bench agentic evaluation suite) and HealthBench (even outperforming proprietary models like OpenAI o1 and GPT‑4o). Note: While referred to as '20b' for simplicity, it technically has 20.9B parameters.
Radar de capacités
Science utilise un proxy de raisonnement lorsque les benchmarks scientifiques dédiés ne sont pas disponibles.
Classements
| Domaine | #Rang | Score | Source |
|---|---|---|---|
| Code Ranking | 196 | 41.0 | AA |
| General Ranking | 147 | 58.0 | AA |
| Math Reasoning | 39 | 90.0 | AA |
| Science | 183 | 49.0 | AA |
Scores de benchmarks (LLM Stats)
Biology
Communication
Finance
Healthcare
Math
Indices d'évaluation AA
Scores par catégorie LLM Stats
Tarification
Vitesse
Fournisseurs disponibles
(Unités internes LS)| Fournisseur | Prix d'entrée | Prix de sortie |
|---|---|---|
| OpenAI | 100K | 500K |
| Fireworks | 100K | 500K |
| Groq | 100K | 500K |