gpt-oss-20B (high)
Descripción
The gpt-oss-20b model (technically 20.9B parameters) achieves near-parity with OpenAI o4-mini on core reasoning benchmarks, while running efficiently on a single 80 GB GPU. The gpt-oss-20b model delivers similar results to OpenAI o3‑mini on common benchmarks and can run on edge devices with just 16 GB of memory, making it ideal for on-device use cases, local inference, or rapid iteration without costly infrastructure. Both models also perform strongly on tool use, few-shot function calling, CoT reasoning (as seen in results on the Tau-Bench agentic evaluation suite) and HealthBench (even outperforming proprietary models like OpenAI o1 and GPT‑4o). Note: While referred to as '20b' for simplicity, it technically has 20.9B parameters.
Radar de capacidades
Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.
Rankings
| Dominio | #Posición | Puntuación | Fuente |
|---|---|---|---|
| Code Ranking | 196 | 41.0 | AA |
| General Ranking | 147 | 58.0 | AA |
| Math Reasoning | 39 | 90.0 | AA |
| Science | 183 | 49.0 | AA |
Puntuaciones de benchmarks (LLM Stats)
Biology
Communication
Finance
Healthcare
Math
Índices de evaluación AA
Puntuaciones por categoría LLM Stats
Precios
Velocidad
Proveedores disponibles
(Unidades internas LS)| Proveedor | Precio de entrada | Precio de salida |
|---|---|---|
| OpenAI | 100K | 500K |
| Fireworks | 100K | 500K |
| Groq | 100K | 500K |