DeepSeek R1 0528 (May '25)
DeepSeekDeepSeekOpen WeightMIT · Commercial OK
描述
DeepSeek-R1-0528 is the May 28, 2025 version of DeepSeek's reasoning model. It features advanced thinking capabilities and serves as a benchmark comparison for newer models like DeepSeek-V3.1. This model excels in complex reasoning tasks, mathematical problem-solving, and code generation through its thinking mode approach.
發布日期
2025-05-28
參數規模
671.0B
上下文長度
164K
支援模態
text
能力雷達圖
43
general
44
coding
83
reasoning
54
science估算
10
agents
0
multimodal
Science 在缺少專門科學評測時使用推理能力代理估算。
排行榜排名
基準測試分數 (LLM Stats)
Agents
BrowseComp
8.9%自報
Terminal-Bench
5.7%自報
Biology
GPQA
81.0%自報
Code
LiveCodeBench
73.3%自報
Aider-Polyglot
71.6%自報
SWE-Bench Verified
44.6%自報
SWE-bench Multilingual
30.5%自報
Factuality
SimpleQA
92.3%自報
Finance
MMLU-Pro
85.0%自報
General
MMLU-Redux
93.4%自報
Math
AIME 2024
91.4%自報
AIME 2025
87.5%自報
HMMT 2025
79.4%自報
CodeForces
0.64 / 3000自報
Humanity's Last Exam
17.7%自報
Reasoning
BrowseComp-zh
35.7%自報
AA 評測指數
Math Index76.0
Intelligence Index27.1
Coding Index24.0
Math 5001.0
Aime0.9
Mmlu Pro0.8
Gpqa0.8
Livecodebench0.8
Aime 250.8
Lcr0.5
Scicode0.4
Ifbench0.4
Tau20.4
Terminalbench Hard0.2
Hle0.1
LLM Stats 分類評分
Factuality90
Language90
Biology80
Chemistry80
Finance80
General80
Healthcare80
Legal80
Physics80
Math70
Reasoning60
Code50
Frontend Development40
Vision20
Search20
Agents10
定價
輸入價格$1.35 / 1M tokens
輸出價格$4.2 / 1M tokens
混合價格(3:1)$2.063 / 1M tokens
速度
Tokens/秒0.0 tokens/s
首Token延遲0.00s
首回答延遲0.00s
可用提供商
(LS 內部計價單位)| 提供商 | 輸入價格 | 輸出價格 |
|---|---|---|
| DeepSeek | 550K | 2.2M |
| Novita | 700K | 2.5M |