DeepSeek R1 Distill Qwen 7B
DeepSeekDeepSeekOpen WeightMIT · Commercial OK
Description
DeepSeek-R1 is the first-generation reasoning model built atop DeepSeek-V3 (671B total parameters, 37B activated per token). It incorporates large-scale reinforcement learning (RL) to enhance its chain-of-thought and reasoning capabilities, delivering strong performance in math, code, and multi-step reasoning tasks.
Release Date
2025-01-20
Parameters
7.6B
Context Length
—
Modalities
—
Capability Radar
40
general
40
coding
90
reasoning
43
scienceest.
75
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
No ranking data available
Benchmark Scores (LLM Stats)
Biology
GPQA
49.1%SR
Code
LiveCodeBench
37.6%SR
Math
MATH-500
92.8%SR
AIME 2024
83.3%SR
AA Evaluation Indices
No AA evaluation data available
LLM Stats Category Scores
Math90
Reasoning70
Physics50
Biology50
Chemistry50
General40
Code40
Pricing
No pricing data available
Speed
No speed data available
Provider Price Ranking
Provider Price Ranking
1 providers
ProviderInputOutput
1Alibaba (China)
$0.072
$0.144
Compare pricing across different API providers for this model.