DeepSeek V3.1 (Reasoning)

DeepSeekDeepSeek

説明

DeepSeek-V3.1 is a hybrid model supporting both thinking and non-thinking modes through different chat templates. Built on DeepSeek-V3.1-Base with a two-phase long context extension (32K phase: 630B tokens, 128K phase: 209B tokens), it features 671B total parameters with 37B activated. Key improvements include smarter tool calling through post-training optimization, higher thinking efficiency achieving comparable quality to DeepSeek-R1-0528 while responding more quickly, and UE8M0 FP8 scale data format for model weights and activations. The model excels in both reasoning tasks (thinking mode) and practical applications (non-thinking mode), with particularly strong performance in code agent tasks, math competitions, and search-based problem solving.

リリース日

2025-08-21

パラメータ

—

コンテキスト長

164K

モダリティ

text

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Agents

Terminal-Bench

31.3%自己申告

BrowseComp

30.0%自己申告

Biology

GPQA

74.9%自己申告

Code

Aider-Polyglot

68.4%自己申告

SWE-Bench Verified

66.0%自己申告

LiveCodeBench

56.4%自己申告

SWE-bench Multilingual

54.5%自己申告

Factuality

SimpleQA

93.4%自己申告

Finance

MMLU-Pro

83.7%自己申告

General

MMLU-Redux

91.8%自己申告

Math

CodeForces

0.70 / 3000自己申告

AIME 2024

66.3%自己申告

AIME 2025

49.8%自己申告

HMMT 2025

33.5%自己申告

Humanity's Last Exam

15.9%自己申告

Reasoning

BrowseComp-zh

49.2%自己申告

AA評価指数

Math Index

89.7

Intelligence Index

20.7

Aime 25

0.9

Mmlu Pro

0.9

Livecodebench

0.8

Gpqa

0.8

Lcr

0.5

Ifbench

0.4

Scicode

0.4

Tau2

0.4

Terminalbench Hard

0.3

Hle

0.1

LLM Statsカテゴリスコア

Language

Factuality

Legal

Finance

General

Healthcare

Physics

Frontend Development

Biology

Chemistry

Math

Reasoning

Code

Agents

Vision

価格設定

入力価格$0.59 / 1Mトークン

出力価格$1.69 / 1Mトークン

混合価格（3:1）$0.865 / 1Mトークン

キャッシュ読み取り価格$0.13 / 1Mトークン

速度

トークン/秒0.0

初トークン遅延0.00s

初回答遅延0.00s

プロバイダー価格ランキング

3 プロバイダー

最安: Kilo Gateway最高: DeepSeek

プロバイダー入力出力

1Kilo Gateway最安

$0.15

$0.75

2OpenRouter

$0.21

$0.79

3DeepSeekプライマリ

$0.59

$1.69

このモデルの異なるAPIプロバイダー間の価格を比較。

外部リンク

Artificial Analysis

ドメイン	#順位	スコア	ソース
エージェント能力	116	31.0	LS
コーディングランキング	103	65.0	AA
総合ランキング	210	48.0	AA
数学的推論	35	91.0	AA
推論	93	49.0	LS
科学	137	56.0	AA