gpt-oss-20B (high)

OpenAI開源權重Apache 2.0 · 商用許可

描述

The gpt-oss-20b model (technically 20.9B parameters) achieves near-parity with OpenAI o4-mini on core reasoning benchmarks, while running efficiently on a single 80 GB GPU. The gpt-oss-20b model delivers similar results to OpenAI o3‑mini on common benchmarks and can run on edge devices with just 16 GB of memory, making it ideal for on-device use cases, local inference, or rapid iteration without costly infrastructure. Both models also perform strongly on tool use, few-shot function calling, CoT reasoning (as seen in results on the Tau-Bench agentic evaluation suite) and HealthBench (even outperforming proprietary models like OpenAI o1 and GPT‑4o). Note: While referred to as '20b' for simplicity, it technically has 20.9B parameters.

發布日期

2025-08-05

參數規模

20.9B

上下文長度

131K

支援模態

text

能力雷達圖

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域	#排名	分數	來源
程式碼能力榜	248	39.0	AA
通用能力榜	171	53.0	AA
數學推理	39	90.0	AA
科學能力	201	48.0	AA

基準測試分數 (LLM Stats)

Biology

GPQA

71.5%自報

Communication

TAU-bench Retail

54.8%自報

Finance

MMLU

85.3%自報

Healthcare

HealthBench

42.5%自報

HealthBench Hard

10.8%自報

Math

CodeForces

0.74 / 3000自報

Humanity's Last Exam

10.9%自報

AA 評測指數

Math Index

89.3

Coding Index

20.7

Intelligence Index

14.9

Aime 25

0.9

Livecodebench

0.8

Mmlu Pro

0.7

Gpqa

0.7

Ifbench

0.7

Tau2

0.6

Scicode

0.3

Lcr

0.3

Terminalbench V2 1

0.1

Terminalbench Hard

0.1

Hle

0.1

Tau Banking

0.1

LLM Stats 分類評分

Language

Legal

Finance

General

Physics

Biology

Chemistry

Math

Reasoning

Healthcare

Communication

Tool Calling

Vision

定價

輸入價格$0.05 / 1M tokens

輸出價格$0.2 / 1M tokens

混合價格(3:1)$0.088 / 1M tokens

速度

Tokens/秒233.2

首Token延遲0.66s

首回答延遲9.23s

供應商價格排行

16 個供應商

最便宜: LLM Gateway最貴: Regolo AI

供應商輸入輸出

1LLM Gateway最便宜

$0.04

$0.15

2Clarifai

$0.045

$0.18

3Helicone

$0.05

$0.2

4OpenAI主要

$0.05

$0.2

5DigitalOcean

$0.05

$0.45

6OVHcloud AI Endpoints

$0.05

$0.18

7Databricks

$0.05

$0.2

8Neon

$0.05

$0.2

9Fireworks AI

$0.07

$0.3

10Amazon Bedrock

$0.07

$0.3

11FrogBot

$0.07

$0.2

12Vertex

$0.07

$0.25

13NanoGPT

$0.2

$0.8

14Cloudflare AI Gateway

$0.2

$0.3

15Cloudflare Workers AI

$0.2

$0.3

16Regolo AI

$0.4

$1.8

比較該模型在不同 API 供應商之間的定價。

外部連結

LLM Stats Artificial Analysis