DeepSeek V4 Flash (Reasoning, High Effort)

DeepSeekDeepSeek

Description

DeepSeek-V4-Flash-Max is the maximum reasoning effort mode of DeepSeek-V4-Flash, a 284B-parameter MoE model with 13B activated parameters and a 1M-token context window. Sharing the V4 series' hybrid attention architecture (Compressed Sparse Attention combined with Heavily Compressed Attention), Manifold-Constrained Hyper-Connections, and Muon optimizer, V4-Flash-Max delivers reasoning performance comparable to V4-Pro when given a larger thinking budget while operating at a fraction of the parameter scale. It is pre-trained on more than 32T tokens and post-trained with a two-stage paradigm of domain-specific expert cultivation followed by on-policy distillation.

Date de sortie

2026-04-24

Paramètres

—

Longueur du contexte

1.0M

Modalités

text

Radar de capacités

general

coding

reasoning

scienceest.

agents

multimodal

Science utilise un proxy de raisonnement lorsque les benchmarks scientifiques dédiés ne sont pas disponibles.

Classements

Domaine	#Rang	Score	Source
Classement codage	72	72.0	AA
Classement général	40	77.0	AA
Science	46	71.0	AA

Scores de benchmarks (LLM Stats)

Agents

GDPval-AA

1203.00 / 3000Aut.

BrowseComp

73.2%Aut.

MCP Atlas

69.0%Aut.

Terminal-Bench 2.0

56.9%Aut.

SWE-Bench Pro

52.6%Aut.

Toolathlon

47.8%Aut.

Biology

GPQA

88.1%Aut.

Code

LiveCodeBench

91.6%Aut.

SWE-Bench Verified

79.0%Aut.

SWE-bench Multilingual

73.3%Aut.

Factuality

SimpleQA

34.1%Aut.

Finance

MMLU-Pro

86.2%Aut.

General

CSimpleQA

78.9%Aut.

MRCR 1M

78.7%Aut.

CorpusQA 1M

60.5%Aut.

Math

CodeForces

1.00 / 3000Aut.

HMMT Feb 26

94.8%Aut.

IMO-AnswerBench

88.4%Aut.

MathArena Apex

85.7%Aut.

Humanity's Last Exam

45.1%Aut.

Indices d'évaluation AA

Intelligence Index

37.4

Tau2

1.0

Gpqa

0.9

Ifbench

0.7

Lcr

0.6

Scicode

0.4

Terminalbench Hard

0.4

Hle

0.3

Scores par catégorie LLM Stats

Legal

100

Finance

100

Agents

100

General

100

Reasoning

Physics

Healthcare

Biology

Chemistry

Language

Long Context

Math

Frontend Development

Code

Tool Calling

Vision

Factuality

Tarification

Prix d'entrée$0.14 / 1M tokens

Prix de sortie$0.28 / 1M tokens

Prix mixte (3:1)$0.175 / 1M tokens

Prix de lecture cache$0.0028 / 1M tokens

Vitesse

Tokens/sec0.0

Délai du premier token0.00s

Temps de réponse0.00s

Classement des Prix par Fournisseur

16 fournisseurs

Moins cher: OpenRouterPlus cher: routing.run

FournisseurEntréeSortie

1OpenRouterMoins cher

$0.09

$0.18

2Deep Infra

$0.1

$0.2

3GMI Cloud

$0.112

$0.224

4DeepSeekPRINCIPAL

$0.14

$0.28

5NanoGPT

$0.14

$0.28

6Fireworks AI

$0.14

$0.28

7Hugging Face

$0.14

$0.28

8ZenMux

$0.14

$0.28

9NovitaAI

$0.14

$0.28

10Kilo Gateway

$0.14

$0.28

11Nvidia

$0.14

$0.28

12SiliconFlow

$0.14

$0.28

13Vercel AI Gateway

$0.14

$0.28

14Merge Gateway

$0.14

$0.28

15OrcaRouter

$0.19

$0.37

16routing.run

$0.4928

$0.7392

Comparer les prix entre différents fournisseurs API pour ce modèle.

Sources externes

Artificial Analysis