deepseek-vl2
deepseek-ai/deepseek-vl2
DeepSeek-VL2 is a mixed-expert (MoE) vision-language model developed based on DeepSeekMoE-27B, employing a sparse-activated MoE architecture to achieve superior performance with only 4.5B active parameters. The model excels in various tasks including visual question answering, optical character recognition, document/table/chart understanding, and visual grounding. Compared to existing open-source dense models and MoE-based models, it demonstrates competitive or state-of-the-art performance using the same or fewer active parameters.
Details
Model Provider
deepseek-ai
Type
text
Sub Type
chat
Size
-1
Publish Time
Dec 13, 2024
Input Price
$
0.15
/ M Tokens
Output Price
$
0.15
/ M Tokens
Context length
4096
Tags
MoE,27B,4K