DeepSeek V3 (Dec '24)

DeepSeekDeepSeekOpen WeightMIT + Model License (Commercial use allowed)

Description

A powerful Mixture-of-Experts (MoE) language model with 671B total parameters (37B activated per token). Features Multi-head Latent Attention (MLA), auxiliary-loss-free load balancing, and multi-token prediction training. Pre-trained on 14.8T tokens with strong performance in reasoning, math, and code tasks.

Release Date

2024-12-26

Parameters

671.0B

Context Length

164K

Modalities

text

Capability Radar

general

coding

reasoning

scienceest.

agents

multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain	#Rank	Score	Source
Code Ranking	273	29.0	AA
General Ranking	306	36.0	AA
Math Reasoning	226	39.0	AA
Reasoning	36	76.0	LS
Science	272	40.0	AA

Benchmark Scores (LLM Stats)

Biology

GPQA

59.1%SR

Code

Aider-Polyglot Edit

79.7%SR

Aider-Polyglot

49.6%SR

SWE-Bench Verified

42.0%SR

LiveCodeBench

37.6%SR

Factuality

SimpleQA

24.9%SR

Finance

MMLU

88.5%SR

MMLU-Pro

75.9%SR

General

MMLU-Redux

89.1%SR

C-Eval

86.5%SR

IFEval

86.1%SR

CSimpleQA

64.8%SR

LongBench v2

48.7%SR

Language

CLUEWSC

90.9%SR

Math

DROP

91.6%SR

MATH-500

90.2%SR

CNMO 2024

43.2%SR

AIME 2024

39.2%SR

Reasoning

HumanEval-Mul

82.6%SR

FRAMES

73.3%SR

AA Evaluation Indices

Math Index

26.0

Intelligence Index

16.5

Coding Index

16.4

Math 500

0.9

Mmlu Pro

0.8

Gpqa

0.6

Livecodebench

0.4

Scicode

0.4

Ifbench

0.3

Lcr

0.3

Aime 25

0.3

Aime

0.3

Tau2

0.2

Terminalbench Hard

0.1

Hle

0.0

LLM Stats Category Scores

Instruction Following

Finance

Healthcare

Language

Legal

Structured Output

General

Math

Reasoning

Biology

Chemistry

Physics

Code

Long Context

Frontend Development

Factuality

Pricing

Input Price$0.4 / 1M tokens

Output Price$0.89 / 1M tokens

Blended Price (3:1)$0.523 / 1M tokens

Speed

Tokens/sec0.0 tokens/s

Time to First Token0.00s

Time to Answer0.00s

Available Providers

(LS internal units)

No provider data available

External Sources

LLM Stats