DeepSeek R1 Distill Qwen 7B

DeepSeekDeepSeek開源權重MIT · 商用許可

描述

DeepSeek-R1 is the first-generation reasoning model built atop DeepSeek-V3 (671B total parameters, 37B activated per token). It incorporates large-scale reinforcement learning (RL) to enhance its chain-of-thought and reasoning capabilities, delivering strong performance in math, code, and multi-step reasoning tasks.

發布日期

2025-01-20

參數規模

7.6B

上下文長度

—

支援模態

—

能力雷達圖

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

暫無排名資料

基準測試分數 (LLM Stats)

Biology

GPQA

49.1%自報

Code

LiveCodeBench

37.6%自報

Math

MATH-500

92.8%自報

AIME 2024

83.3%自報

AA 評測指數

暫無 AA 評測資料

LLM Stats 分類評分

Math

Reasoning

Physics

Biology

Chemistry

General

Code

定價

暫無定價資料

速度

暫無速度資料

供應商價格排行

1 個供應商

供應商輸入輸出

1Alibaba (China)

$0.072

$0.144

比較該模型在不同 API 供應商之間的定價。

外部連結

LLM Stats Artificial Analysis