DeepSeek V4 Flash (Non-reasoning)

DeepSeekDeepSeek

Description

DeepSeek-V4-Flash-Max is the maximum reasoning effort mode of DeepSeek-V4-Flash, a 284B-parameter MoE model with 13B activated parameters and a 1M-token context window. Sharing the V4 series' hybrid attention architecture (Compressed Sparse Attention combined with Heavily Compressed Attention), Manifold-Constrained Hyper-Connections, and Muon optimizer, V4-Flash-Max delivers reasoning performance comparable to V4-Pro when given a larger thinking budget while operating at a fraction of the parameter scale. It is pre-trained on more than 32T tokens and post-trained with a two-stage paradigm of domain-specific expert cultivation followed by on-policy distillation.

Release Date

2026-04-24

Parameters

—

Context Length

1.0M

Modalities

text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Agentic Capability	52	56.0	LS
Code Ranking	194	49.0	AA
General Ranking	126	60.0	AA
Science	191	49.0	AA

Benchmark Scores (LLM Stats)

Agents

GDPval-AA

1203.00 / 3000SR

BrowseComp

73.2%SR

MCP Atlas

69.0%SR

Terminal-Bench 2.0

56.9%SR

SWE-Bench Pro

52.6%SR

Toolathlon

47.8%SR

Biology

GPQA

88.1%SR

Code

LiveCodeBench

91.6%SR

SWE-Bench Verified

79.0%SR

SWE-bench Multilingual

73.3%SR

Factuality

SimpleQA

34.1%SR

Finance

MMLU-Pro

86.2%SR

General

CSimpleQA

78.9%SR

MRCR 1M

78.7%SR

CorpusQA 1M

60.5%SR

Math

CodeForces

1.00 / 3000SR

HMMT Feb 26

94.8%SR

IMO-AnswerBench

88.4%SR

MathArena Apex

85.7%SR

Humanity's Last Exam

45.1%SR

AA Evaluation Indices

Intelligence Index

28.7

Tau2

0.9

Gpqa

0.7

Ifbench

0.5

Scicode

0.4

Terminalbench Hard

0.3

Lcr

0.3

Hle

0.1

LLM Stats Category Scores

Legal

100

Finance

100

Agents

100

General

100

Reasoning

Physics

Healthcare

Biology

Chemistry

Language

Long Context

Math

Frontend Development

Code

Tool Calling

Vision

Factuality

Pricing

Input Price$0.14 / 1M tokens

Output Price$0.28 / 1M tokens

Blended Price (3:1)$0.175 / 1M tokens

Cache Read Price$0.0028 / 1M tokens

Speed

Tokens/sec120.2

Time to First Token1.07s

Time to Answer1.07s

Provider Price Ranking

11 providers

Cheapest: CrofAIMost Expensive: Azure

ProviderInputOutput

1CrofAICheapest

$0.12

$0.21

2Cortecs

$0.133

$0.266

3DeepSeekPRIMARY

$0.14

$0.28

4OpenCode Go

$0.14

$0.28

5Alibaba (China)

$0.14

$0.28

6OpenCode Zen

$0.14

$0.28

7Wafer

$0.14

$0.28

8LLM Gateway

$0.14

$0.28

9Auriko

$0.14

$0.28

10Venice AI

$0.17

$0.35

11Azure

$0.19

$0.51

Compare pricing across different API providers for this model.

External Sources

Artificial Analysis