DeepSeek V4 Flash (Reasoning, Max Effort)

DeepSeekDeepSeekOpen WeightMIT · Commercial OK

Descripción

DeepSeek-V4-Flash-Max is the maximum reasoning effort mode of DeepSeek-V4-Flash, a 284B-parameter MoE model with 13B activated parameters and a 1M-token context window. Sharing the V4 series' hybrid attention architecture (Compressed Sparse Attention combined with Heavily Compressed Attention), Manifold-Constrained Hyper-Connections, and Muon optimizer, V4-Flash-Max delivers reasoning performance comparable to V4-Pro when given a larger thinking budget while operating at a fraction of the parameter scale. It is pre-trained on more than 32T tokens and post-trained with a two-stage paradigm of domain-specific expert cultivation followed by on-policy distillation.

Fecha de lanzamiento

2026-04-24

Parámetros

284.0B

Longitud del contexto

1.0M

Modalidades

text

Radar de capacidades

general

coding

reasoning

scienceest.

agents

multimodal

Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.

Rankings

Dominio	#Posición	Puntuación	Fuente
Agents & Tools	49	57.0	LS
Code Ranking	64	68.0	AA
General Ranking	24	85.0	AA
Science	25	80.0	AA

Puntuaciones de benchmarks (LLM Stats)

Agents

GDPval-AA

1395.00 / 3000Aut.

BrowseComp

73.2%Aut.

MCP Atlas

69.0%Aut.

Terminal-Bench 2.0

56.9%Aut.

SWE-Bench Pro

52.6%Aut.

Toolathlon

47.8%Aut.

Biology

GPQA

88.1%Aut.

Code

LiveCodeBench

91.6%Aut.

SWE-Bench Verified

79.0%Aut.

SWE-bench Multilingual

73.3%Aut.

Factuality

SimpleQA

34.1%Aut.

Finance

MMLU-Pro

86.2%Aut.

General

CSimpleQA

78.9%Aut.

MRCR 1M

78.7%Aut.

CorpusQA 1M

60.5%Aut.

Math

CodeForces

1.00 / 3000Aut.

HMMT Feb 26

94.8%Aut.

IMO-AnswerBench

88.4%Aut.

MathArena Apex

85.7%Aut.

Humanity's Last Exam

45.1%Aut.

Índices de evaluación AA

Intelligence Index

46.5

Coding Index

38.7

Tau2

0.9

Gpqa

0.9

Ifbench

0.8

Lcr

0.6

Scicode

0.4

Terminalbench Hard

0.4

Hle

0.3

Puntuaciones por categoría LLM Stats

Finance

100

Legal

100

Agents

100

General

100

Reasoning

Biology

Chemistry

Healthcare

Physics

Frontend Development

Language

Long Context

Math

Code

Tool Calling

Vision

Factuality

Precios

Precio de entrada$0.14 / 1M tokens

Precio de salida$0.28 / 1M tokens

Precio mixto (3:1)$0.175 / 1M tokens

Velocidad

Tokens/seg74.3 tokens/s

Retraso del primer token0.82s

Tiempo hasta la respuesta76.34s

Proveedores disponibles

(Unidades internas LS)

Proveedor	Precio de entrada	Precio de salida
DeepSeek	140K	280K

Fuentes externas

LLM Stats