gpt-oss-20B (high)

OpenAIオープンウエイトApache 2.0 · 商用利用可

説明

The gpt-oss-20b model (technically 20.9B parameters) achieves near-parity with OpenAI o4-mini on core reasoning benchmarks, while running efficiently on a single 80 GB GPU. The gpt-oss-20b model delivers similar results to OpenAI o3‑mini on common benchmarks and can run on edge devices with just 16 GB of memory, making it ideal for on-device use cases, local inference, or rapid iteration without costly infrastructure. Both models also perform strongly on tool use, few-shot function calling, CoT reasoning (as seen in results on the Tau-Bench agentic evaluation suite) and HealthBench (even outperforming proprietary models like OpenAI o1 and GPT‑4o). Note: While referred to as '20b' for simplicity, it technically has 20.9B parameters.

リリース日

2025-08-05

パラメータ

20.9B

コンテキスト長

131K

モダリティ

text

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Biology

GPQA

71.5%自己申告

Communication

TAU-bench Retail

54.8%自己申告

Finance

MMLU

85.3%自己申告

Healthcare

HealthBench

42.5%自己申告

HealthBench Hard

10.8%自己申告

Math

CodeForces

0.74 / 3000自己申告

Humanity's Last Exam

10.9%自己申告

AA評価指数

Math Index

89.3

Coding Index

20.7

Intelligence Index

14.9

Aime 25

0.9

Livecodebench

0.8

Mmlu Pro

0.7

Gpqa

0.7

Ifbench

0.7

Tau2

0.6

Scicode

0.3

Lcr

0.3

Terminalbench V2 1

0.1

Terminalbench Hard

0.1

Hle

0.1

Tau Banking

0.1

LLM Statsカテゴリスコア

Language

Legal

Finance

General

Physics

Biology

Chemistry

Math

Reasoning

Healthcare

Communication

Tool Calling

Vision

価格設定

入力価格$0.05 / 1Mトークン

出力価格$0.2 / 1Mトークン

混合価格（3:1）$0.088 / 1Mトークン

速度

トークン/秒233.2

初トークン遅延0.66s

初回答遅延9.23s

プロバイダー価格ランキング

16 プロバイダー

最安: LLM Gateway最高: Regolo AI

プロバイダー入力出力

1LLM Gateway最安

$0.04

$0.15

2Clarifai

$0.045

$0.18

3Helicone

$0.05

$0.2

4OpenAIプライマリ

$0.05

$0.2

5DigitalOcean

$0.05

$0.45

6OVHcloud AI Endpoints

$0.05

$0.18

7Databricks

$0.05

$0.2

8Neon

$0.05

$0.2

9Fireworks AI

$0.07

$0.3

10Amazon Bedrock

$0.07

$0.3

11FrogBot

$0.07

$0.2

12Vertex

$0.07

$0.25

13NanoGPT

$0.2

$0.8

14Cloudflare AI Gateway

$0.2

$0.3

15Cloudflare Workers AI

$0.2

$0.3

16Regolo AI

$0.4

$1.8

このモデルの異なるAPIプロバイダー間の価格を比較。

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
コーディングランキング	248	39.0	AA
総合ランキング	171	53.0	AA
数学的推論	39	90.0	AA
科学	201	48.0	AA