DeepSeek R1 Distill Qwen 7B
DeepSeekDeepSeekOpen WeightMIT · Commercial OK
Description
DeepSeek-R1 is the first-generation reasoning model built atop DeepSeek-V3 (671B total parameters, 37B activated per token). It incorporates large-scale reinforcement learning (RL) to enhance its chain-of-thought and reasoning capabilities, delivering strong performance in math, code, and multi-step reasoning tasks.
Release Date
2025-01-20
Parameters
7.6B
Context Length
—
Modalities
—
Capability Radar
40
general
40
coding
90
reasoning
43
scienceest.
0
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
No ranking data available
Benchmark Scores (LLM Stats)
Biology
GPQA
49.1%SR
Code
LiveCodeBench
37.6%SR
Math
MATH-500
92.8%SR
AIME 2024
83.3%SR
AA Evaluation Indices
No AA evaluation data available
LLM Stats Category Scores
Math90
Reasoning70
Biology50
Chemistry50
Physics50
Code40
General40
Pricing
No pricing data available
Speed
No speed data available
Available Providers
(LS internal units)No provider data available