DeepSeek R1 Distill Qwen 7B

DeepSeekDeepSeek开源权重MIT · 商用许可

描述

DeepSeek-R1 is the first-generation reasoning model built atop DeepSeek-V3 (671B total parameters, 37B activated per token). It incorporates large-scale reinforcement learning (RL) to enhance its chain-of-thought and reasoning capabilities, delivering strong performance in math, code, and multi-step reasoning tasks.

发布日期

2025-01-20

参数规模

7.6B

上下文长度

—

支持模态

—

能力雷达图

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少专门科学评测时使用推理能力代理估算。

排行榜排名

暂无排名数据

基准测试分数 (LLM Stats)

Biology

GPQA

49.1%自报

Code

LiveCodeBench

37.6%自报

Math

MATH-500

92.8%自报

AIME 2024

83.3%自报

AA 评测指数

暂无 AA 评测数据

LLM Stats 分类评分

Math

Reasoning

Physics

Biology

Chemistry

General

Code

定价

暂无定价数据

速度

暂无速度数据

供应商价格排行

1 个供应商

供应商输入输出

1Alibaba (China)

$0.072

$0.144

比较该模型在不同 API 供应商之间的定价。

外部链接

LLM Stats Artificial Analysis