gpt-oss-120B (high)
OpenAIOpen WeightApache 2.0 · Commercial OK
描述
GPT-OSS-120B is an open-weight, 116.8B-parameter Mixture-of-Experts (MoE) language model from OpenAI designed for high-reasoning, agentic, and general-purpose production use cases. It activates 5.1B parameters per forward pass and is optimized to run on a single H100 GPU with native MXFP4 quantization. The model supports configurable reasoning depth, full chain-of-thought access, and native tool use, including function calling, browsing, and structured output generation. It achieves near-parity with OpenAI o4-mini on core reasoning benchmarks. Note: While referred to as '120b' for simplicity, it technically has 116.8B parameters.
发布日期
2025-08-05
参数规模
116.8B
上下文长度
131K
支持模态
text
能力雷达图
45
general
50
coding
91
reasoning
53
science估算
70
agents
0
multimodal
Science 在缺少专门科学评测时使用推理能力代理估算。
排行榜排名
基准测试分数 (LLM Stats)
Biology
GPQA
80.1%自报
Communication
TAU-bench Retail
67.8%自报
Finance
MMLU
90.0%自报
Healthcare
HealthBench
57.6%自报
HealthBench Hard
30.0%自报
Math
CodeForces
0.82 / 3000自报
Humanity's Last Exam
14.9%自报
AA 评测指数
Math Index93.4
Intelligence Index33.3
Coding Index28.6
Aime 250.9
Livecodebench0.9
Mmlu Pro0.8
Gpqa0.8
Ifbench0.7
Tau20.7
Lcr0.5
Scicode0.4
Terminalbench Hard0.2
Hle0.2
LLM Stats 分类评分
Finance90
General90
Language90
Legal90
Biology80
Chemistry80
Physics80
Tool Calling70
Communication70
Reasoning70
Healthcare60
Math60
Vision10
定价
输入价格$0.15 / 1M tokens
输出价格$0.6 / 1M tokens
混合价格(3:1)$0.262 / 1M tokens
速度
Tokens/秒251.0 tokens/s
首Token延迟0.50s
首回答延迟8.47s
可用提供商
(LS 内部计价单位)| 提供商 | 输入价格 | 输出价格 |
|---|---|---|
| DeepInfra | 90K | 450K |
| OpenAI | 100K | 500K |
| Novita | 100K | 500K |
| Fireworks | 150K | 600K |
| Groq | 150K | 600K |