DeepSeek-V2.5 (Dec '24)

DeepSeekDeepSeekオープンウエイトdeepseek

説明

DeepSeek-V2.5 is an upgraded version that combines DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct, integrating general and coding abilities. It better aligns with human preferences and has been optimized in various aspects, including writing and instruction following.

リリース日

2024-12-10

パラメータ

236.0B

コンテキスト長

—

モダリティ

text

能力レーダー

general

coding

reasoning

science推定

agents

multimodal

専門的な科学ベンチマークが利用できない場合、Scienceは推論プロキシを使用して推定します。

ベンチマークスコア (LLM Stats)

Code

HumanEval

89.0%自己申告

Aider

72.2%自己申告

SWE-Bench Verified

16.8%自己申告

Communication

MT-Bench

0.90 / 100自己申告

Creativity

AlignBench

80.4%自己申告

Arena Hard

76.2%自己申告

AlpacaEval 2.0

50.5%自己申告

Finance

MMLU

80.4%自己申告

General

DS-FIM-Eval

78.3%自己申告

LiveCodeBench(01-09)

41.8%自己申告

Language

BBH

84.3%自己申告

Math

GSM8k

95.1%自己申告

MATH

74.7%自己申告

Reasoning

HumanEval-Mul

73.8%自己申告

DS-Arena-Code

63.1%自己申告

AA評価指数

Intelligence Index

6.8

Math 500

0.8

LLM Statsカテゴリスコア

Roleplay

Communication

Language

Legal

Math

Finance

General

Healthcare

Reasoning

Creativity

Writing

Code

Frontend Development

価格設定

入力価格無料

出力価格無料

混合価格（3:1）無料

速度

トークン/秒0.0

初トークン遅延0.00s

初回答遅延0.00s

プロバイダー価格ランキング

プロバイダーデータがありません

外部リンク

LLM Stats Artificial Analysis

ドメイン	#順位	スコア	ソース
総合ランキング	505	10.0	AA
数学的推論	104	75.0	AA