Skip to main content

DeepSeek V4 Flash (Reasoning, Max Effort)

DeepSeekDeepSeekOpen WeightMIT · Commercial OK

Description

DeepSeek-V4-Flash-Max is the maximum reasoning effort mode of DeepSeek-V4-Flash, a 284B-parameter MoE model with 13B activated parameters and a 1M-token context window. Sharing the V4 series' hybrid attention architecture (Compressed Sparse Attention combined with Heavily Compressed Attention), Manifold-Constrained Hyper-Connections, and Muon optimizer, V4-Flash-Max delivers reasoning performance comparable to V4-Pro when given a larger thinking budget while operating at a fraction of the parameter scale. It is pre-trained on more than 32T tokens and post-trained with a two-stage paradigm of domain-specific expert cultivation followed by on-policy distillation.

Release Date
2026-04-24
Parameters
284.0B
Context Length
1.0M
Modalities
text

Capability Radar

43
general
40
coding
89
reasoning
62
scienceest.
60
agents
0
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Agents & Tools49
57.0
LS
Code Ranking64
68.0
AA
General Ranking24
85.0
AA
Science25
80.0
AA

Benchmark Scores (LLM Stats)

Agents

GDPval-AA1395.00 / 3000SR
BrowseComp73.2%SR
MCP Atlas69.0%SR
Terminal-Bench 2.056.9%SR
SWE-Bench Pro52.6%SR
Toolathlon47.8%SR

Biology

GPQA88.1%SR

Code

LiveCodeBench91.6%SR
SWE-Bench Verified79.0%SR
SWE-bench Multilingual73.3%SR

Factuality

SimpleQA34.1%SR

Finance

MMLU-Pro86.2%SR

General

CSimpleQA78.9%SR
MRCR 1M78.7%SR
CorpusQA 1M60.5%SR

Math

CodeForces1.00 / 3000SR
HMMT Feb 2694.8%SR
IMO-AnswerBench88.4%SR
MathArena Apex85.7%SR
Humanity's Last Exam45.1%SR

AA Evaluation Indices

Intelligence Index
46.5
Coding Index
38.7
Tau2
0.9
Gpqa
0.9
Ifbench
0.8
Lcr
0.6
Scicode
0.4
Terminalbench Hard
0.4
Hle
0.3

LLM Stats Category Scores

Finance
100
Legal
100
Agents
100
General
100
Reasoning
78
Biology
90
Chemistry
90
Healthcare
90
Physics
90
Frontend Development
80
Language
80
Long Context
80
Math
80
Code
70
Search
70
Tool Calling
60
Vision
50
Factuality
30

Pricing

Input Price$0.14 / 1M tokens
Output Price$0.28 / 1M tokens
Blended Price (3:1)$0.175 / 1M tokens

Speed

Tokens/sec74.3 tokens/s
Time to First Token0.82s
Time to Answer76.34s

Available Providers

(LS internal units)
ProviderInput PriceOutput Price
DeepSeek140K280K

External Sources