Skip to main content

DeepSeek V4 Flash (Non-reasoning)

DeepSeekDeepSeek

Description

DeepSeek-V4-Flash-Max is the maximum reasoning effort mode of DeepSeek-V4-Flash, a 284B-parameter MoE model with 13B activated parameters and a 1M-token context window. Sharing the V4 series' hybrid attention architecture (Compressed Sparse Attention combined with Heavily Compressed Attention), Manifold-Constrained Hyper-Connections, and Muon optimizer, V4-Flash-Max delivers reasoning performance comparable to V4-Pro when given a larger thinking budget while operating at a fraction of the parameter scale. It is pre-trained on more than 32T tokens and post-trained with a two-stage paradigm of domain-specific expert cultivation followed by on-policy distillation.

Release Date
2026-04-24
Parameters
Context Length
1.0M
Modalities
text

Capability Radar

24
general
37
coding
72
reasoning
47
scienceest.
60
agents
0
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Agentic Capability52
56.0
LS
Code Ranking194
49.0
AA
General Ranking126
60.0
AA
Science191
49.0
AA

Benchmark Scores (LLM Stats)

Agents

GDPval-AA1203.00 / 3000SR
BrowseComp73.2%SR
MCP Atlas69.0%SR
Terminal-Bench 2.056.9%SR
SWE-Bench Pro52.6%SR
Toolathlon47.8%SR

Biology

GPQA88.1%SR

Code

LiveCodeBench91.6%SR
SWE-Bench Verified79.0%SR
SWE-bench Multilingual73.3%SR

Factuality

SimpleQA34.1%SR

Finance

MMLU-Pro86.2%SR

General

CSimpleQA78.9%SR
MRCR 1M78.7%SR
CorpusQA 1M60.5%SR

Math

CodeForces1.00 / 3000SR
HMMT Feb 2694.8%SR
IMO-AnswerBench88.4%SR
MathArena Apex85.7%SR
Humanity's Last Exam45.1%SR

AA Evaluation Indices

Intelligence Index
28.7
Tau2
0.9
Gpqa
0.7
Ifbench
0.5
Scicode
0.4
Terminalbench Hard
0.3
Lcr
0.3
Hle
0.1

LLM Stats Category Scores

Legal
100
Finance
100
Agents
100
General
100
Reasoning
68
Physics
90
Healthcare
90
Biology
90
Chemistry
90
Language
80
Long Context
80
Math
80
Frontend Development
80
Search
70
Code
70
Tool Calling
60
Vision
50
Factuality
30

Pricing

Input Price$0.14 / 1M tokens
Output Price$0.28 / 1M tokens
Blended Price (3:1)$0.175 / 1M tokens
Cache Read Price$0.0028 / 1M tokens

Speed

Tokens/sec120.2
Time to First Token1.07s
Time to Answer1.07s

Provider Price Ranking

Provider Price Ranking

11 providers

Cheapest: CrofAIMost Expensive: Azure
ProviderInputOutput
1CrofAICheapest
$0.12
$0.21
2Cortecs
$0.133
$0.266
3DeepSeekPRIMARY
$0.14
$0.28
4OpenCode Go
$0.14
$0.28
5Alibaba (China)
$0.14
$0.28
6OpenCode Zen
$0.14
$0.28
7Wafer
$0.14
$0.28
8LLM Gateway
$0.14
$0.28
9Auriko
$0.14
$0.28
10Venice AI
$0.17
$0.35
11Azure
$0.19
$0.51

Compare pricing across different API providers for this model.

External Sources