DeepSeek R1 Distill Qwen 1.5B

DeepSeekDeepSeek오픈 웨이트MIT · 상업적 사용 가능

설명

DeepSeek-R1 is the first-generation reasoning model built atop DeepSeek-V3 (671B total parameters, 37B activated per token). It incorporates large-scale reinforcement learning (RL) to enhance its chain-of-thought and reasoning capabilities, delivering strong performance in math, code, and multi-step reasoning tasks.

출시일

2025-01-20

파라미터

1.8B

컨텍스트 길이

—

모달리티

—

능력 레이더

general

coding

reasoning

science추정

agents

multimodal

전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.

랭킹

도메인	#순위	점수	소스
코딩 랭킹	476	4.0	AA
종합 랭킹	511	9.0	AA
수학 추론	269	30.0	AA
과학	509	5.0	AA

벤치마크 점수 (LLM Stats)

Biology

GPQA

33.8%자체 보고

Code

LiveCodeBench

16.9%자체 보고

Math

MATH-500

83.9%자체 보고

AIME 2024

52.7%자체 보고

AA 평가 지수

Math Index

22.0

Intelligence Index

3.7

Math 500

0.7

Mmlu Pro

0.3

Aime 25

0.2

Aime

0.2

Ifbench

0.1

Gpqa

0.1

Livecodebench

0.1

Scicode

0.1

Hle

0.0

Lcr

0.0

LLM Stats 카테고리 점수

Math

Reasoning

Physics

General

Biology

Chemistry

Code

가격

입력 가격무료

출력 가격무료

혼합 가격 (3:1)무료

속도

토큰/초0.0

첫 토큰 지연0.00s

첫 응답 지연0.00s

공급자 가격 순위

프로바이더 데이터가 없습니다

외부 링크

LLM Stats Artificial Analysis