DeepSeek VL2 Tiny
DeepSeekDeepSeekOpen Weightdeepseek
Description
An advanced series of large Mixture-of-Experts (MoE) Vision-Language Models that significantly improves upon its predecessor, DeepSeek-VL. DeepSeek-VL2 demonstrates superior capabilities across various tasks, including but not limited to visual question answering, optical character recognition, document/table/chart understanding, and visual grounding.
Release Date
2024-12-13
Parameters
3.0B
Context Length
164K
Modalities
text
Capability Radar
50
general
0
coding
50
reasoning
34
scienceest.
0
agents
0
multimodal
Science uses a reasoning proxy when dedicated science benchmarks are unavailable.
Rankings
| Domain | #Rank | Score | Source |
|---|---|---|---|
| Multimodal Ranking | 63 | 69.0 | LS |
Benchmark Scores (LLM Stats)
General
MMT-Bench
53.2%SR
MMStar
45.9%SR
MMMU
40.7%SR
Image To Text
DocVQA
88.9%SR
OCRBench
80.9%SR
TextVQA
80.7%SR
Math
MathVista
53.6%SR
Multimodal
ChartQA
81.0%SR
AI2D
71.6%SR
MMBench
69.2%SR
MMBench-V1.1
68.3%SR
InfoVQA
66.1%SR
MME
19.1%SR
Spatial Reasoning
RealWorldQA
64.2%SR
AA Evaluation Indices
No AA evaluation data available
LLM Stats Category Scores
Image To Text80
Spatial Reasoning60
Vision60
Multimodal60
Reasoning60
General50
Math50
Healthcare40
Pricing
Input Price$0.32 / 1M tokens
Output Price$0.89 / 1M tokens
Blended Price (3:1)$0.4625 / 1M tokens
Speed
No speed data available
Available Providers
(LS internal units)No provider data available