gpt-oss-120b (high)

OpenAIOpen WeightApache 2.0 · Commercial OK

Description

GPT-OSS-120B is an open-weight, 116.8B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized to run on a single H100 GPU with native MXFP4 quantization. The model supports configurable reasoning depth, full chain-of-thought access, and native tool use, including function calling, browsing, and structured output generation. It achieves near-parity with OpenAI o4-mini on core reasoning benchmarks. Note: While referred to as '120b' for simplicity, it technically has 116.8B parameters.

Release Date

2025-08-05

Parameters

116.8B

Context Length

131K

Modalities

text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Code Ranking	146	56.0	AA
General Ranking	103	63.0	AA
Math Reasoning	22	94.0	AA
Science	108	60.0	AA

Benchmark Scores (LLM Stats)

Biology

GPQA

80.1%SR

Communication

TAU-bench Retail

67.8%SR

Finance

MMLU

90.0%SR

Healthcare

HealthBench

57.6%SR

HealthBench Hard

30.0%SR

Math

CodeForces

0.82 / 3000SR

Humanity's Last Exam

14.9%SR

AA Evaluation Indices

Math Index

93.4

Coding Index

30.4

Intelligence Index

23.8

Aime 25

0.9

Livecodebench

0.9

Mmlu Pro

0.8

Gpqa

0.8

Ifbench

0.7

Tau2

0.7

Lcr

0.5

Scicode

0.4

Terminalbench V2 1

0.3

Terminalbench Hard

0.2

Hle

0.2

Tau Banking

0.1

LLM Stats Category Scores

Language

Legal

Finance

General

Physics

Biology

Chemistry

Reasoning

Communication

Tool Calling

Math

Healthcare

Vision

Pricing

Input Price$0.15 / 1M tokens

Output Price$0.6 / 1M tokens

Blended Price (3:1)$0.262 / 1M tokens

Speed

Tokens/sec347.4

Time to First Token0.54s

Time to Answer6.29s

Provider Price Ranking

23 providers

Cheapest: DeepInfraMost Expensive: Regolo AI

ProviderInputOutput

1DeepInfraCheapest

2OpenAI

3Novita

4Fireworks

5Groq

6Helicone

$0.04

$0.16

7LLM Gateway

$0.05

$0.25

8DInference

$0.0675

$0.27

9Venice AI

$0.07

$0.3

10Databricks

$0.072

$0.28

11Neon

$0.072

$0.28

12Chutes

$0.09

$0.36

13OVHcloud AI Endpoints

$0.09

$0.47

14Vertex

$0.09

$0.36

15Clarifai

$0.09

$0.36

16Nebius Token Factory

$0.1

$0.5

17DigitalOcean

$0.1

$0.7

18Scaleway

$0.15

$0.6

19Amazon Bedrock

$0.15

$0.6

20FrogBot

$0.15

$0.6

21Poe

$0.35

$0.75

22Cerebras

$0.35

$0.75

23Regolo AI

$4.2

Compare pricing across different API providers for this model.

External Sources

LLM Stats Artificial Analysis