Saltar al contenido principal

DeepSeek V4 Flash (Reasoning, High Effort)

DeepSeekDeepSeek

Descripción

DeepSeek-V4-Flash-Max is the maximum reasoning effort mode of DeepSeek-V4-Flash, a 284B-parameter MoE model with 13B activated parameters and a 1M-token context window. Sharing the V4 series' hybrid attention architecture (Compressed Sparse Attention combined with Heavily Compressed Attention), Manifold-Constrained Hyper-Connections, and Muon optimizer, V4-Flash-Max delivers reasoning performance comparable to V4-Pro when given a larger thinking budget while operating at a fraction of the parameter scale. It is pre-trained on more than 32T tokens and post-trained with a two-stage paradigm of domain-specific expert cultivation followed by on-policy distillation.

Fecha de lanzamiento
2026-04-24
Parámetros
Longitud del contexto
1.0M
Modalidades
text

Radar de capacidades

35
general
42
coding
87
reasoning
59
scienceest.
60
agents
0
multimodal

Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.

Rankings

Dominio#PosiciónPuntuaciónFuente
Ranking de codificación72
72.0
AA
Ranking general40
77.0
AA
Ciencia46
71.0
AA

Puntuaciones de benchmarks (LLM Stats)

Agents

GDPval-AA1203.00 / 3000Aut.
BrowseComp73.2%Aut.
MCP Atlas69.0%Aut.
Terminal-Bench 2.056.9%Aut.
SWE-Bench Pro52.6%Aut.
Toolathlon47.8%Aut.

Biology

GPQA88.1%Aut.

Code

LiveCodeBench91.6%Aut.
SWE-Bench Verified79.0%Aut.
SWE-bench Multilingual73.3%Aut.

Factuality

SimpleQA34.1%Aut.

Finance

MMLU-Pro86.2%Aut.

General

CSimpleQA78.9%Aut.
MRCR 1M78.7%Aut.
CorpusQA 1M60.5%Aut.

Math

CodeForces1.00 / 3000Aut.
HMMT Feb 2694.8%Aut.
IMO-AnswerBench88.4%Aut.
MathArena Apex85.7%Aut.
Humanity's Last Exam45.1%Aut.

Índices de evaluación AA

Intelligence Index
37.4
Tau2
1.0
Gpqa
0.9
Ifbench
0.7
Lcr
0.6
Scicode
0.4
Terminalbench Hard
0.4
Hle
0.3

Puntuaciones por categoría LLM Stats

Legal
100
Finance
100
Agents
100
General
100
Reasoning
68
Physics
90
Healthcare
90
Biology
90
Chemistry
90
Language
80
Long Context
80
Math
80
Frontend Development
80
Search
70
Code
70
Tool Calling
60
Vision
50
Factuality
30

Precios

Precio de entrada$0.14 / 1M tokens
Precio de salida$0.28 / 1M tokens
Precio mixto (3:1)$0.175 / 1M tokens
Precio de lectura caché$0.0028 / 1M tokens

Velocidad

Tokens/seg0.0
Retraso del primer token0.00s
Tiempo hasta la respuesta0.00s

Ranking de Precios por Proveedor

Ranking de Precios por Proveedor

16 proveedores

Más barato: OpenRouterMás caro: routing.run
ProveedorEntradaSalida
1OpenRouterMás barato
$0.09
$0.18
2Deep Infra
$0.1
$0.2
3GMI Cloud
$0.112
$0.224
4DeepSeekPRINCIPAL
$0.14
$0.28
5NanoGPT
$0.14
$0.28
6Fireworks AI
$0.14
$0.28
7Hugging Face
$0.14
$0.28
8ZenMux
$0.14
$0.28
9NovitaAI
$0.14
$0.28
10Kilo Gateway
$0.14
$0.28
11Nvidia
$0.14
$0.28
12SiliconFlow
$0.14
$0.28
13Vercel AI Gateway
$0.14
$0.28
14Merge Gateway
$0.14
$0.28
15OrcaRouter
$0.19
$0.37
16routing.run
$0.4928
$0.7392

Comparar precios entre diferentes proveedores de API para este modelo.

Fuentes externas