DeepSeek V3.1 (Non-reasoning)
설명
DeepSeek-V3.1 is a hybrid model supporting both thinking and non-thinking modes through different chat templates. Built on DeepSeek-V3.1-Base with a two-phase long context extension (32K phase: 630B tokens, 128K phase: 209B tokens), it features 671B total parameters with 37B activated. Key improvements include smarter tool calling through post-training optimization, higher thinking efficiency achieving comparable quality to DeepSeek-R1-0528 while responding more quickly, and UE8M0 FP8 scale data format for model weights and activations. The model excels in both reasoning tasks (thinking mode) and practical applications (non-thinking mode), with particularly strong performance in code agent tasks, math competitions, and search-based problem solving.
능력 레이더
전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.
랭킹
| 도메인 | #순위 | 점수 | 소스 |
|---|---|---|---|
| Agents & Tools | 95 | 31.0 | LS |
| Code Ranking | 138 | 52.0 | AA |
| General Ranking | 208 | 49.0 | AA |
| Math Reasoning | 183 | 50.0 | AA |
| Reasoning | 88 | 49.0 | LS |
| Science | 179 | 50.0 | AA |
벤치마크 점수 (LLM Stats)
Agents
Biology
Code
Factuality
Finance
General
Math
Reasoning
AA 평가 지수
LLM Stats 카테고리 점수
가격
속도
사용 가능한 프로바이더
(LS 내부 단위)| 프로바이더 | 입력 가격 | 출력 가격 |
|---|---|---|
| Novita | 270K | 1.0M |
| DeepInfra | 270K | 1.0M |