DeepSeek VL2 Small

DeepSeekDeepSeek開源權重deepseek

描述

An advanced series of large Mixture-of-Experts (MoE) Vision-Language Models that significantly improves upon its predecessor, DeepSeek-VL. DeepSeek-VL2 demonstrates superior capabilities across various tasks, including but not limited to visual question answering, optical character recognition, document/table/chart understanding, and visual grounding.

發布日期

2024-12-13

參數規模

16.0B

上下文長度

—

支援模態

—

能力雷達圖

general

coding

reasoning

science估算

agents

multimodal

Science 在缺少專門科學評測時使用推理能力代理估算。

排行榜排名

領域	#排名	分數	來源
多模態榜	53	75.0	LS

基準測試分數 (LLM Stats)

General

MMT-Bench

62.9%自報

MMStar

57.0%自報

MMMU

48.0%自報

Image To Text

DocVQA

92.3%自報

TextVQA

83.4%自報

OCRBench

83.4%自報

Math

MathVista

60.7%自報

Multimodal

ChartQA

84.5%自報

MMBench

80.3%自報

AI2D

80.0%自報

MMBench-V1.1

79.3%自報

InfoVQA

75.8%自報

MME

21.2%自報

Spatial Reasoning

RealWorldQA

65.4%自報

AA 評測指數

暫無 AA 評測資料

LLM Stats 分類評分

Image To Text

Multimodal

Spatial Reasoning

Vision

Math

Reasoning

General

Healthcare

定價

暫無定價資料

速度

暫無速度資料

供應商價格排行

暫無提供商資料

外部連結

LLM Stats Artificial Analysis