DeepSeek V3.1 (Reasoning)
설명
DeepSeek-V3.1 is a hybrid model supporting both thinking and non-thinking modes through different chat templates. Built on DeepSeek-V3.1-Base with a two-phase long context extension (32K phase: 630B tokens, 128K phase: 209B tokens), it features 671B total parameters with 37B activated. Key improvements include smarter tool calling through post-training optimization, higher thinking efficiency achieving comparable quality to DeepSeek-R1-0528 while responding more quickly, and UE8M0 FP8 scale data format for model weights and activations. The model excels in both reasoning tasks (thinking mode) and practical applications (non-thinking mode), with particularly strong performance in code agent tasks, math competitions, and search-based problem solving.
능력 레이더
전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.
랭킹
벤치마크 점수 (LLM Stats)
Agents
Biology
Code
Factuality
Finance
General
Math
Reasoning
AA 평가 지수
LLM Stats 카테고리 점수
가격
속도
공급자 가격 순위
공급자 가격 순위
3개 공급자
이 모델의 다양한 API 공급자 간 가격 비교.