gpt-oss-120B (high)
OpenAIOpen WeightApache 2.0 · Commercial OK
Description
GPT-OSS-120B is an open-weight, 116.8B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized to run on a single H100 GPU with native MXFP4 quantization. The model supports configurable reasoning depth, full chain-of-thought access, and native tool use, including function calling, browsing, and structured output generation. It achieves near-parity with OpenAI o4-mini on core reasoning benchmarks. Note: While referred to as '120b' for simplicity, it technically has 116.8B parameters.
Release Date
2025-08-05
Parameters
116.8B
Context Length
131K
Modalities
text
Capability Radar
45
general
50
coding
91
reasoning
53
scienceest.
70
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Code Ranking | 100 | 60.0 | AA |
| General Ranking | 91 | 68.0 | AA |
| Math Reasoning | 22 | 94.0 | AA |
| Science | 94 | 62.0 | AA |
Benchmark Scores (LLM Stats)
Biology
GPQA
80.1%SR
Communication
TAU-bench Retail
67.8%SR
Finance
MMLU
90.0%SR
Healthcare
HealthBench
57.6%SR
HealthBench Hard
30.0%SR
Math
CodeForces
0.82 / 3000SR
Humanity's Last Exam
14.9%SR
AA Evaluation Indices
Math Index93.4
Intelligence Index33.3
Coding Index28.6
Aime 250.9
Livecodebench0.9
Mmlu Pro0.8
Gpqa0.8
Ifbench0.7
Tau20.7
Lcr0.5
Scicode0.4
Terminalbench Hard0.2
Hle0.2
LLM Stats Category Scores
Finance90
General90
Language90
Legal90
Biology80
Chemistry80
Physics80
Tool Calling70
Communication70
Reasoning70
Healthcare60
Math60
Vision10
Pricing
Input Price$0.15 / 1M tokens
Output Price$0.6 / 1M tokens
Blended Price (3:1)$0.262 / 1M tokens
Speed
Tokens/sec251.0 tokens/s
Time to First Token0.50s
Time to Answer8.47s
Available Providers
(LS internal units)| Provider | Input Price | Output Price |
|---|---|---|
| DeepInfra | 90K | 450K |
| OpenAI | 100K | 500K |
| Novita | 100K | 500K |
| Fireworks | 150K | 600K |
| Groq | 150K | 600K |