跳转到主要内容

gpt-oss-120B (high)

OpenAIOpen WeightApache 2.0 · Commercial OK

描述

GPT-OSS-120B is an open-weight, 116.8B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized to run on a single H100 GPU with native MXFP4 quantization. The model supports configurable reasoning depth, full chain-of-thought access, and native tool use, including function calling, browsing, and structured output generation. It achieves near-parity with OpenAI o4-mini on core reasoning benchmarks. Note: While referred to as '120b' for simplicity, it technically has 116.8B parameters.

发布日期
2025-08-05
参数规模
116.8B
上下文长度
131K
支持模态
text

能力雷达图

45
general
50
coding
91
reasoning
53
science估算
70
agents
0
multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

领域#排名分数来源
代码能力榜100
60.0
AA
通用能力榜91
68.0
AA
数学推理22
94.0
AA
科学能力94
62.0
AA

基准测试分数 (LLM Stats)

Biology

GPQA80.1%自报

Communication

TAU-bench Retail67.8%自报

Finance

MMLU90.0%自报

Healthcare

HealthBench57.6%自报
HealthBench Hard30.0%自报

Math

CodeForces0.82 / 3000自报
Humanity's Last Exam14.9%自报

AA 评测指数

Math Index
93.4
Intelligence Index
33.3
Coding Index
28.6
Aime 25
0.9
Livecodebench
0.9
Mmlu Pro
0.8
Gpqa
0.8
Ifbench
0.7
Tau2
0.7
Lcr
0.5
Scicode
0.4
Terminalbench Hard
0.2
Hle
0.2

LLM Stats 分类评分

Finance
90
General
90
Language
90
Legal
90
Biology
80
Chemistry
80
Physics
80
Tool Calling
70
Communication
70
Reasoning
70
Healthcare
60
Math
60
Vision
10

定价

输入价格$0.15 / 1M tokens
输出价格$0.6 / 1M tokens
混合价格(3:1)$0.262 / 1M tokens

速度

Tokens/秒251.0 tokens/s
首Token延迟0.50s
首回答延迟8.47s

可用提供商

(LS 内部计价单位)
提供商输入价格输出价格
DeepInfra90K450K
OpenAI100K500K
Novita100K500K
Fireworks150K600K
Groq150K600K

外部链接