Skip to main content

DeepSeek VL2 Tiny

DeepSeekDeepSeekOpen Weightdeepseek

Description

An advanced series of large Mixture-of-Experts (MoE) Vision-Language Models that significantly improves upon its predecessor, DeepSeek-VL. DeepSeek-VL2 demonstrates superior capabilities across various tasks, including but not limited to visual question answering, optical character recognition, document/table/chart understanding, and visual grounding.

Release Date
2024-12-13
Parameters
3.0B
Context Length
Modalities

Capability Radar

50
general
0
coding
50
reasoning
34
scienceest.
35
agents
80
multimodal

Science uses a reasoning proxy when dedicated science benchmarks are unavailable.

Rankings

Domain#RankScoreSource
Multimodal Ranking72
69.0
LS

Benchmark Scores (LLM Stats)

General

MMT-Bench53.2%SR
MMStar45.9%SR
MMMU40.7%SR

Image To Text

DocVQA88.9%SR
OCRBench80.9%SR
TextVQA80.7%SR

Math

MathVista53.6%SR

Multimodal

ChartQA81.0%SR
AI2D71.6%SR
MMBench69.2%SR
MMBench-V1.168.3%SR
InfoVQA66.1%SR
MME19.1%SR

Spatial Reasoning

RealWorldQA64.2%SR

AA Evaluation Indices

No AA evaluation data available

LLM Stats Category Scores

Image To Text
80
Multimodal
60
Reasoning
60
Spatial Reasoning
60
Vision
60
Math
50
General
50
Healthcare
40

Pricing

No pricing data available

Speed

No speed data available

Provider Price Ranking

No provider data available

External Sources