DeepSeek V4 Pro (Reasoning, Max Effort)
説明
DeepSeek-V4-Pro-Max is the maximum reasoning effort mode of DeepSeek-V4-Pro, a 1.6T-parameter MoE model with 49B activated parameters and a 1M-token context window. It introduces a hybrid attention architecture combining Compressed Sparse Attention (CSA) and Heavily Compressed Attention (HCA) for dramatically improved long-context efficiency, requiring only 27% of single-token inference FLOPs and 10% of KV cache compared with DeepSeek-V3.2 at 1M-token context. The model also incorporates Manifold-Constrained Hyper-Connections (mHC) for stable signal propagation and is trained with the Muon optimizer for faster convergence. Pre-trained on more than 32T tokens, V4-Pro-Max significantly advances open-source knowledge capabilities, achieves top-tier performance in coding benchmarks, and bridges the gap with leading closed-source models on reasoning and agentic tasks.
能力レーダー
専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。
ランキング
| ドメイン | #順位 | スコア | ソース |
|---|---|---|---|
| Agents & Tools | 28 | 64.0 | LS |
| Code Ranking | 19 | 81.0 | AA |
| General Ranking | 11 | 89.0 | AA |
| Science | 16 | 86.0 | AA |
ベンチマークスコア (LLM Stats)
Agents
Biology
Code
Factuality
Finance
General
Math
AA評価指数
LLM Statsカテゴリスコア
価格設定
速度
利用可能なプロバイダー
(LS内部単位)| プロバイダー | 入力価格 | 出力価格 |
|---|---|---|
| DeepSeek | 1.7M | 3.5M |
| DeepInfra | 1.7M | 3.5M |