Skip to main content

DeepSeek V4 Flash (Reasoning, Max Effort)

DeepSeekDeepSeekOpen WeightMIT · Commercial OK

Description

DeepSeek-V4-Flash-Max is the maximum reasoning effort mode of DeepSeek-V4-Flash, a 284B-parameter MoE model with 13B activated parameters and a 1M-token context window. Sharing the V4 series' hybrid attention architecture (Compressed Sparse Attention combined with Heavily Compressed Attention), Manifold-Constrained Hyper-Connections, and Muon optimizer, V4-Flash-Max delivers reasoning performance comparable to V4-Pro when given a larger thinking budget while operating at a fraction of the parameter scale. It is pre-trained on more than 32T tokens and post-trained with a two-stage paradigm of domain-specific expert cultivation followed by on-policy distillation.

Release Date
2026-04-24
Parameters
284.0B
Context Length
1.0M
Modalities
text

Capability Radar

39
general
54
coding
89
reasoning
62
scienceest.
60
agents
0
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Code Ranking68
72.0
AA
General Ranking20
81.0
AA
Science32
76.0
AA

Benchmark Scores (LLM Stats)

Agents

GDPval-AA1203.00 / 3000SR
BrowseComp73.2%SR
MCP Atlas69.0%SR
Terminal-Bench 2.056.9%SR
SWE-Bench Pro52.6%SR
Toolathlon47.8%SR

Biology

GPQA88.1%SR

Code

LiveCodeBench91.6%SR
SWE-Bench Verified79.0%SR
SWE-bench Multilingual73.3%SR

Factuality

SimpleQA34.1%SR

Finance

MMLU-Pro86.2%SR

General

CSimpleQA78.9%SR
MRCR 1M78.7%SR
CorpusQA 1M60.5%SR

Math

CodeForces1.00 / 3000SR
HMMT Feb 2694.8%SR
IMO-AnswerBench88.4%SR
MathArena Apex85.7%SR
Humanity's Last Exam45.1%SR

AA Evaluation Indices

Coding Index
56.2
Intelligence Index
40.3
Tau2
1.0
Gpqa
0.9
Ifbench
0.8
Lcr
0.6
Terminalbench V2 1
0.6
Scicode
0.4
Terminalbench Hard
0.4
Hle
0.3
Tau Banking
0.2

LLM Stats Category Scores

Legal
100
Finance
100
Agents
100
General
100
Reasoning
68
Physics
90
Healthcare
90
Biology
90
Chemistry
90
Language
80
Long Context
80
Math
80
Frontend Development
80
Search
70
Code
70
Tool Calling
60
Vision
50
Factuality
30

Pricing

Input Price$0.14 / 1M tokens
Output Price$0.28 / 1M tokens
Blended Price (3:1)$0.175 / 1M tokens
Cache Read Price$0.0028 / 1M tokens

Speed

Tokens/sec116.1
Time to First Token1.05s
Time to Answer49.40s

Provider Price Ranking

Provider Price Ranking

4 providers

Cheapest: DeepSeekMost Expensive: routing.run
ProviderInputOutput
1DeepSeekCheapest
$0
$0
2Poe
$0.14
$0.28
3AIHubMix
$0.14
$0.28
4routing.run
$0.4928
$0.7392

Compare pricing across different API providers for this model.

External Sources