DeepSeek V3.1 (Reasoning)

DeepSeekDeepSeek

설명

DeepSeek-V3.1 is a hybrid model supporting both thinking and non-thinking modes through different chat templates. Built on DeepSeek-V3.1-Base with a two-phase long context extension (32K phase: 630B tokens, 128K phase: 209B tokens), it features 671B total parameters with 37B activated. Key improvements include smarter tool calling through post-training optimization, higher thinking efficiency achieving comparable quality to DeepSeek-R1-0528 while responding more quickly, and UE8M0 FP8 scale data format for model weights and activations. The model excels in both reasoning tasks (thinking mode) and practical applications (non-thinking mode), with particularly strong performance in code agent tasks, math competitions, and search-based problem solving.

출시일

2025-08-21

파라미터

—

컨텍스트 길이

164K

모달리티

text

능력 레이더

general

coding

reasoning

science추정

agents

multimodal

전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.

랭킹

도메인	#순위	점수	소스
에이전트형 역량	116	31.0	LS
코딩 랭킹	103	65.0	AA
종합 랭킹	210	48.0	AA
수학 추론	35	91.0	AA
추론	93	49.0	LS
과학	137	56.0	AA

벤치마크 점수 (LLM Stats)

Agents

Terminal-Bench

31.3%자체 보고

BrowseComp

30.0%자체 보고

Biology

GPQA

74.9%자체 보고

Code

Aider-Polyglot

68.4%자체 보고

SWE-Bench Verified

66.0%자체 보고

LiveCodeBench

56.4%자체 보고

SWE-bench Multilingual

54.5%자체 보고

Factuality

SimpleQA

93.4%자체 보고

Finance

MMLU-Pro

83.7%자체 보고

General

MMLU-Redux

91.8%자체 보고

Math

CodeForces

0.70 / 3000자체 보고

AIME 2024

66.3%자체 보고

AIME 2025

49.8%자체 보고

HMMT 2025

33.5%자체 보고

Humanity's Last Exam

15.9%자체 보고

Reasoning

BrowseComp-zh

49.2%자체 보고

AA 평가 지수

Math Index

89.7

Intelligence Index

20.7

Aime 25

0.9

Mmlu Pro

0.9

Livecodebench

0.8

Gpqa

0.8

Lcr

0.5

Ifbench

0.4

Scicode

0.4

Tau2

0.4

Terminalbench Hard

0.3

Hle

0.1

LLM Stats 카테고리 점수

Language

Factuality

Legal

Finance

General

Healthcare

Physics

Frontend Development

Biology

Chemistry

Math

Reasoning

Code

Agents

Vision

가격

입력 가격$0.59 / 1M 토큰

출력 가격$1.69 / 1M 토큰

혼합 가격 (3:1)$0.865 / 1M 토큰

캐시 읽기 가격$0.13 / 1M 토큰

속도

토큰/초0.0

첫 토큰 지연0.00s

첫 응답 지연0.00s

공급자 가격 순위

3개 공급자

최저가: Kilo Gateway최고가: DeepSeek

공급자입력출력

1Kilo Gateway최저가

$0.15

$0.75

2OpenRouter

$0.21

$0.79

3DeepSeek주요

$0.59

$1.69

이 모델의 다양한 API 공급자 간 가격 비교.

외부 링크

Artificial Analysis