DeepSeek V4 Flash (Reasoning, Max Effort)

DeepSeekDeepSeek開源權重MIT · 商用許可

描述

DeepSeek-V4-Flash-Max is the maximum reasoning effort mode of DeepSeek-V4-Flash, a 284B-parameter MoE model with 13B activated parameters and a 1M-token context window. Sharing the V4 series' hybrid attention architecture (Compressed Sparse Attention combined with Heavily Compressed Attention), Manifold-Constrained Hyper-Connections, and Muon optimizer, V4-Flash-Max delivers reasoning performance comparable to V4-Pro when given a larger thinking budget while operating at a fraction of the parameter scale. It is pre-trained on more than 32T tokens and post-trained with a two-stage paradigm of domain-specific expert cultivation followed by on-policy distillation.

發布日期

2026-04-24

參數規模

284.0B

上下文長度

1.0M

支援模態

text

能力雷達圖

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域	#排名	分數	來源
程式碼能力榜	68	72.0	AA
通用能力榜	20	81.0	AA
科學能力	32	76.0	AA

基準測試分數 (LLM Stats)

Agents

GDPval-AA

1203.00 / 3000自報

BrowseComp

73.2%自報

MCP Atlas

69.0%自報

Terminal-Bench 2.0

56.9%自報

SWE-Bench Pro

52.6%自報

Toolathlon

47.8%自報

Biology

GPQA

88.1%自報

Code

LiveCodeBench

91.6%自報

SWE-Bench Verified

79.0%自報

SWE-bench Multilingual

73.3%自報

Factuality

SimpleQA

34.1%自報

Finance

MMLU-Pro

86.2%自報

General

CSimpleQA

78.9%自報

MRCR 1M

78.7%自報

CorpusQA 1M

60.5%自報

Math

CodeForces

1.00 / 3000自報

HMMT Feb 26

94.8%自報

IMO-AnswerBench

88.4%自報

MathArena Apex

85.7%自報

Humanity's Last Exam

45.1%自報

AA 評測指數

Coding Index

56.2

Intelligence Index

40.3

Tau2

1.0

Gpqa

0.9

Ifbench

0.8

Lcr

0.6

Terminalbench V2 1

0.6

Scicode

0.4

Terminalbench Hard

0.4

Hle

0.3

Tau Banking

0.2

LLM Stats 分類評分

Legal

100

Finance

100

Agents

100

General

100

Reasoning

Physics

Healthcare

Biology

Chemistry

Language

Long Context

Math

Frontend Development

Code

Tool Calling

Vision

Factuality

定價

輸入價格$0.14 / 1M tokens

輸出價格$0.28 / 1M tokens

混合價格(3:1)$0.175 / 1M tokens

快取讀取價格$0.0028 / 1M tokens

速度

Tokens/秒116.1

首Token延遲1.05s

首回答延遲49.40s

供應商價格排行

4 個供應商

最便宜: DeepSeek最貴: routing.run

供應商輸入輸出

1DeepSeek最便宜

2Poe

$0.14

$0.28

3AIHubMix

$0.14

$0.28

4routing.run

$0.4928

$0.7392

比較該模型在不同 API 供應商之間的定價。

外部連結

LLM Stats Artificial Analysis