DeepSeek-V2.5 (Dec '24)

DeepSeekDeepSeek오픈 웨이트deepseek

설명

DeepSeek-V2.5 is an upgraded version that combines DeepSeek-V2-Chat and DeepSeek-Coder-V2-Instruct, integrating general and coding abilities. It better aligns with human preferences and has been optimized in various aspects, including writing and instruction following.

출시일

2024-12-10

파라미터

236.0B

컨텍스트 길이

—

모달리티

text

능력 레이더

general

coding

reasoning

science추정

agents

multimodal

전용 과학 벤치마크가 없을 때 Science는 추론 프록시를 사용하여 추정합니다.

랭킹

도메인	#순위	점수	소스
종합 랭킹	505	10.0	AA
수학 추론	104	75.0	AA

벤치마크 점수 (LLM Stats)

Code

HumanEval

89.0%자체 보고

Aider

72.2%자체 보고

SWE-Bench Verified

16.8%자체 보고

Communication

MT-Bench

0.90 / 100자체 보고

Creativity

AlignBench

80.4%자체 보고

Arena Hard

76.2%자체 보고

AlpacaEval 2.0

50.5%자체 보고

Finance

MMLU

80.4%자체 보고

General

DS-FIM-Eval

78.3%자체 보고

LiveCodeBench(01-09)

41.8%자체 보고

Language

BBH

84.3%자체 보고

Math

GSM8k

95.1%자체 보고

MATH

74.7%자체 보고

Reasoning

HumanEval-Mul

73.8%자체 보고

DS-Arena-Code

63.1%자체 보고

AA 평가 지수

Intelligence Index

6.8

Math 500

0.8

LLM Stats 카테고리 점수

Roleplay

Communication

Language

Legal

Math

Finance

General

Healthcare

Reasoning

Creativity

Writing

Code

Frontend Development

가격

입력 가격무료

출력 가격무료

혼합 가격 (3:1)무료

속도

토큰/초0.0

첫 토큰 지연0.00s

첫 응답 지연0.00s

공급자 가격 순위

프로바이더 데이터가 없습니다

외부 링크

LLM Stats Artificial Analysis