gpt-oss-20B (high)

OpenAI开源权重Apache 2.0 · 商用许可

描述

The gpt-oss-20b model (technically 20.9B parameters) achieves near-parity with OpenAI o4-mini on core reasoning benchmarks, while running efficiently on a single 80 GB GPU. The gpt-oss-20b model delivers similar results to OpenAI o3‑mini on common benchmarks and can run on edge devices with just 16 GB of memory, making it ideal for on-device use cases, local inference, or rapid iteration without costly infrastructure. Both models also perform strongly on tool use, few-shot function calling, CoT reasoning (as seen in results on the Tau-Bench agentic evaluation suite) and HealthBench (even outperforming proprietary models like OpenAI o1 and GPT‑4o). Note: While referred to as '20b' for simplicity, it technically has 20.9B parameters.

发布日期

2025-08-05

参数规模

20.9B

上下文长度

131K

支持模态

text

能力雷达图

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域	#排名	分数	来源
代码能力榜	248	39.0	AA
通用能力榜	171	53.0	AA
数学推理	39	90.0	AA
科学能力	201	48.0	AA

基准测试分数 (LLM Stats)

Biology

GPQA

71.5%自报

Communication

TAU-bench Retail

54.8%自报

Finance

MMLU

85.3%自报

Healthcare

HealthBench

42.5%自报

HealthBench Hard

10.8%自报

Math

CodeForces

0.74 / 3000自报

Humanity's Last Exam

10.9%自报

AA 评测指数

Math Index

89.3

Coding Index

20.7

Intelligence Index

14.9

Aime 25

0.9

Livecodebench

0.8

Mmlu Pro

0.7

Gpqa

0.7

Ifbench

0.7

Tau2

0.6

Scicode

0.3

Lcr

0.3

Terminalbench V2 1

0.1

Terminalbench Hard

0.1

Hle

0.1

Tau Banking

0.1

LLM Stats 分类评分

Language

Legal

Finance

General

Physics

Biology

Chemistry

Math

Reasoning

Healthcare

Communication

Tool Calling

Vision

定价

输入价格$0.05 / 1M tokens

输出价格$0.2 / 1M tokens

混合价格(3:1)$0.088 / 1M tokens

速度

Tokens/秒233.2

首Token延迟0.66s

首回答延迟9.23s

供应商价格排行

16 个供应商

最便宜: LLM Gateway最贵: Regolo AI

供应商输入输出

1LLM Gateway最便宜

$0.04

$0.15

2Clarifai

$0.045

$0.18

3Helicone

$0.05

$0.2

4OpenAI主要

$0.05

$0.2

5DigitalOcean

$0.05

$0.45

6OVHcloud AI Endpoints

$0.05

$0.18

7Databricks

$0.05

$0.2

8Neon

$0.05

$0.2

9Fireworks AI

$0.07

$0.3

10Amazon Bedrock

$0.07

$0.3

11FrogBot

$0.07

$0.2

12Vertex

$0.07

$0.25

13NanoGPT

$0.2

$0.8

14Cloudflare AI Gateway

$0.2

$0.3

15Cloudflare Workers AI

$0.2

$0.3

16Regolo AI

$0.4

$1.8

比较该模型在不同 API 供应商之间的定价。

外部链接

LLM Stats Artificial Analysis