DeepSeek V4 Flash (Reasoning, Max Effort)
DeepSeekDeepSeekOpen WeightMIT · Commercial OK
描述
DeepSeek-V4-Flash-Max is the maximum reasoning effort mode of DeepSeek-V4-Flash, a 284B-parameter MoE model with 13B activated parameters and a 1M-token context window. Sharing the V4 series' hybrid attention architecture (Compressed Sparse Attention combined with Heavily Compressed Attention), Manifold-Constrained Hyper-Connections, and Muon optimizer, V4-Flash-Max delivers reasoning performance comparable to V4-Pro when given a larger thinking budget while operating at a fraction of the parameter scale. It is pre-trained on more than 32T tokens and post-trained with a two-stage paradigm of domain-specific expert cultivation followed by on-policy distillation.
發布日期
2026-04-24
參數規模
284.0B
上下文長度
1.0M
支援模態
text
能力雷達圖
43
general
40
coding
89
reasoning
62
science估算
60
agents
0
multimodal
Science 在缺少專門科學評測時使用推理能力代理估算。
排行榜排名
基準測試分數 (LLM Stats)
Agents
GDPval-AA
1395.00 / 3000自報
BrowseComp
73.2%自報
MCP Atlas
69.0%自報
Terminal-Bench 2.0
56.9%自報
SWE-Bench Pro
52.6%自報
Toolathlon
47.8%自報
Biology
GPQA
88.1%自報
Code
LiveCodeBench
91.6%自報
SWE-Bench Verified
79.0%自報
SWE-bench Multilingual
73.3%自報
Factuality
SimpleQA
34.1%自報
Finance
MMLU-Pro
86.2%自報
General
CSimpleQA
78.9%自報
MRCR 1M
78.7%自報
CorpusQA 1M
60.5%自報
Math
CodeForces
1.00 / 3000自報
HMMT Feb 26
94.8%自報
IMO-AnswerBench
88.4%自報
MathArena Apex
85.7%自報
Humanity's Last Exam
45.1%自報
AA 評測指數
Intelligence Index46.5
Coding Index38.7
Tau20.9
Gpqa0.9
Ifbench0.8
Lcr0.6
Scicode0.4
Terminalbench Hard0.4
Hle0.3
LLM Stats 分類評分
Finance100
Legal100
Agents100
General100
Reasoning78
Biology90
Chemistry90
Healthcare90
Physics90
Frontend Development80
Language80
Long Context80
Math80
Code70
Search70
Tool Calling60
Vision50
Factuality30
定價
輸入價格$0.14 / 1M tokens
輸出價格$0.28 / 1M tokens
混合價格(3:1)$0.175 / 1M tokens
速度
Tokens/秒74.3 tokens/s
首Token延遲0.82s
首回答延遲76.34s
可用提供商
(LS 內部計價單位)| 提供商 | 輸入價格 | 輸出價格 |
|---|---|---|
| DeepSeek | 140K | 280K |