DeepSeek R1 Distill Qwen 32B

DeepSeekDeepSeekOpen WeightMIT · Commercial OK

Description

DeepSeek-R1 is the first-generation reasoning model built atop DeepSeek-V3 (671B total parameters, 37B activated per token). It incorporates large-scale reinforcement learning (RL) to enhance its chain-of-thought and reasoning capabilities, delivering strong performance in math, code, and multi-step reasoning tasks.

Release Date

2025-01-20

Parameters

32.8B

Context Length

—

Modalities

text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Code Ranking	361	21.0	AA
General Ranking	336	33.0	AA
Math Reasoning	113	72.0	AA
Science	253	44.0	AA

Benchmark Scores (LLM Stats)

Biology

GPQA

62.1%SR

Code

LiveCodeBench

57.2%SR

Math

MATH-500

94.3%SR

AIME 2024

83.3%SR

AA Evaluation Indices

Math Index

63.0

Intelligence Index

11.0

Math 500

0.9

Mmlu Pro

0.7

Aime

0.7

Aime 25

0.6

Gpqa

0.6

Scicode

0.4

Livecodebench

0.3

Ifbench

0.2

Lcr

0.1

Hle

0.1

LLM Stats Category Scores

Math

Reasoning

Physics

General

Biology

Chemistry

Code

Pricing

Input PriceFree

Output PriceFree

Blended Price (3:1)Free

Speed

Tokens/sec0.0

Time to First Token0.00s

Time to Answer0.00s

Provider Price Ranking

8 providers

Cheapest: SiliconFlow (China)Most Expensive: NanoGPT

ProviderInputOutput

1SiliconFlow (China)Cheapest

$0.18

2SiliconFlow

$0.18

3Alibaba (China)

$0.287

$0.861

4Kilo Gateway

$0.29

5NovitaAI

$0.3

6Cloudflare Workers AI

$0.497

$4.881

7Cloudflare AI Gateway

$0.5

$4.88

8NanoGPT

$1.4

Compare pricing across different API providers for this model.

External Sources

LLM Stats Artificial Analysis