DeepSeek V4 Pro (Reasoning, High Effort)
Описание
DeepSeek-V4-Pro-Max is the maximum reasoning effort mode of DeepSeek-V4-Pro, a 1.6T-parameter MoE model with 49B activated parameters and a 1M-token context window. It introduces a hybrid attention architecture combining Compressed Sparse Attention (CSA) and Heavily Compressed Attention (HCA) for dramatically improved long-context efficiency, requiring only 27% of single-token inference FLOPs and 10% of KV cache compared with DeepSeek-V3.2 at 1M-token context. The model also incorporates Manifold-Constrained Hyper-Connections (mHC) for stable signal propagation and is trained with the Muon optimizer for faster convergence. Pre-trained on more than 32T tokens, V4-Pro-Max significantly advances open-source knowledge capabilities, achieves top-tier performance in coding benchmarks, and bridges the gap with leading closed-source models on reasoning and agentic tasks.
Радар способностей
Science использует прокси на основе рассуждений, когда специализированные научные бенчмарки недоступны.
Рейтинги
| Домен | #Место | Оценка | Источник |
|---|---|---|---|
| Рейтинг кодинга | 54 | 76.0 | AA |
| Общий рейтинг | 26 | 79.0 | AA |
| Наука | 28 | 78.0 | AA |
Оценки бенчмарков (LLM Stats)
Agents
Biology
Code
Factuality
Finance
General
Math
Индексы оценки AA
Оценки категорий LLM Stats
Цены
Скорость
Рейтинг цен провайдеров
Рейтинг цен провайдеров
1 провайдеров
Сравнение цен разных API-провайдеров для этой модели.