메인 콘텐츠로 건너뛰기

gpt-oss-20B (high)

OpenAIOpen WeightApache 2.0 · Commercial OK

설명

The gpt-oss-20b model (technically 20.9B parameters) achieves near-parity with OpenAI o4-mini on core reasoning benchmarks, while running efficiently on a single 80 GB GPU. The gpt-oss-20b model delivers similar results to OpenAI o3‑mini on common benchmarks and can run on edge devices with just 16 GB of memory, making it ideal for on-device use cases, local inference, or rapid iteration without costly infrastructure. Both models also perform strongly on tool use, few-shot function calling, CoT reasoning (as seen in results on the Tau-Bench agentic evaluation suite) and HealthBench (even outperforming proprietary models like OpenAI o1 and GPT‑4o). Note: While referred to as '20b' for simplicity, it technically has 20.9B parameters.

출시일
2025-08-05
파라미터
20.9B
컨텍스트 길이
131K
모달리티
text

능력 레이더

37
general
41
coding
86
reasoning
45
science추정
50
agents
0
multimodal

전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.

랭킹

도메인#순위점수소스
Code Ranking196
41.0
AA
General Ranking147
58.0
AA
Math Reasoning39
90.0
AA
Science183
49.0
AA

벤치마크 점수 (LLM Stats)

Biology

GPQA71.5%자체 보고

Communication

TAU-bench Retail54.8%자체 보고

Finance

MMLU85.3%자체 보고

Healthcare

HealthBench42.5%자체 보고
HealthBench Hard10.8%자체 보고

Math

CodeForces0.74 / 3000자체 보고
Humanity's Last Exam10.9%자체 보고

AA 평가 지수

Math Index
89.3
Intelligence Index
24.5
Coding Index
18.5
Aime 25
0.9
Livecodebench
0.8
Mmlu Pro
0.7
Gpqa
0.7
Ifbench
0.7
Tau2
0.6
Scicode
0.3
Lcr
0.3
Terminalbench Hard
0.1
Hle
0.1

LLM Stats 카테고리 점수

Finance
90
Language
90
Legal
90
General
80
Biology
70
Chemistry
70
Physics
70
Math
60
Reasoning
60
Tool Calling
50
Communication
50
Healthcare
50
Vision
10

가격

입력 가격$0.05 / 1M tokens
출력 가격$0.2 / 1M tokens
혼합 가격 (3:1)$0.088 / 1M tokens

속도

토큰/초282.4 tokens/s
첫 토큰 지연0.36s
첫 응답 지연7.44s

사용 가능한 프로바이더

(LS 내부 단위)
프로바이더입력 가격출력 가격
OpenAI100K500K
Fireworks100K500K
Groq100K500K

외부 링크