Saltar al contenido principal

DeepSeek V4 Flash (Non-reasoning)

DeepSeekDeepSeek

Descripción

DeepSeek-V4-Flash-Max is the maximum reasoning effort mode of DeepSeek-V4-Flash, a 284B-parameter MoE model with 13B activated parameters and a 1M-token context window. Sharing the V4 series' hybrid attention architecture (Compressed Sparse Attention combined with Heavily Compressed Attention), Manifold-Constrained Hyper-Connections, and Muon optimizer, V4-Flash-Max delivers reasoning performance comparable to V4-Pro when given a larger thinking budget while operating at a fraction of the parameter scale. It is pre-trained on more than 32T tokens and post-trained with a two-stage paradigm of domain-specific expert cultivation followed by on-policy distillation.

Fecha de lanzamiento
2026-04-24
Parámetros
Longitud del contexto
1.0M
Modalidades
text

Radar de capacidades

24
general
37
coding
72
reasoning
47
scienceest.
60
agents
0
multimodal

Science usa un proxy de razonamiento cuando los benchmarks científicos dedicados no están disponibles.

Rankings

Dominio#PosiciónPuntuaciónFuente
Capacidad agéntica52
56.0
LS
Ranking de codificación194
49.0
AA
Ranking general126
60.0
AA
Ciencia191
49.0
AA

Puntuaciones de benchmarks (LLM Stats)

Agents

GDPval-AA1203.00 / 3000Aut.
BrowseComp73.2%Aut.
MCP Atlas69.0%Aut.
Terminal-Bench 2.056.9%Aut.
SWE-Bench Pro52.6%Aut.
Toolathlon47.8%Aut.

Biology

GPQA88.1%Aut.

Code

LiveCodeBench91.6%Aut.
SWE-Bench Verified79.0%Aut.
SWE-bench Multilingual73.3%Aut.

Factuality

SimpleQA34.1%Aut.

Finance

MMLU-Pro86.2%Aut.

General

CSimpleQA78.9%Aut.
MRCR 1M78.7%Aut.
CorpusQA 1M60.5%Aut.

Math

CodeForces1.00 / 3000Aut.
HMMT Feb 2694.8%Aut.
IMO-AnswerBench88.4%Aut.
MathArena Apex85.7%Aut.
Humanity's Last Exam45.1%Aut.

Índices de evaluación AA

Intelligence Index
28.7
Tau2
0.9
Gpqa
0.7
Ifbench
0.5
Scicode
0.4
Terminalbench Hard
0.3
Lcr
0.3
Hle
0.1

Puntuaciones por categoría LLM Stats

Legal
100
Finance
100
Agents
100
General
100
Reasoning
68
Physics
90
Healthcare
90
Biology
90
Chemistry
90
Language
80
Long Context
80
Math
80
Frontend Development
80
Search
70
Code
70
Tool Calling
60
Vision
50
Factuality
30

Precios

Precio de entrada$0.14 / 1M tokens
Precio de salida$0.28 / 1M tokens
Precio mixto (3:1)$0.175 / 1M tokens
Precio de lectura caché$0.0028 / 1M tokens

Velocidad

Tokens/seg120.2
Retraso del primer token1.07s
Tiempo hasta la respuesta1.07s

Ranking de Precios por Proveedor

Ranking de Precios por Proveedor

11 proveedores

Más barato: CrofAIMás caro: Azure
ProveedorEntradaSalida
1CrofAIMás barato
$0.12
$0.21
2Cortecs
$0.133
$0.266
3DeepSeekPRINCIPAL
$0.14
$0.28
4OpenCode Go
$0.14
$0.28
5Alibaba (China)
$0.14
$0.28
6OpenCode Zen
$0.14
$0.28
7Wafer
$0.14
$0.28
8LLM Gateway
$0.14
$0.28
9Auriko
$0.14
$0.28
10Venice AI
$0.17
$0.35
11Azure
$0.19
$0.51

Comparar precios entre diferentes proveedores de API para este modelo.

Fuentes externas