DeepSeek-V3
deepseek-ai/DeepSeek-V3
The new version of DeepSeek-V3 (DeepSeek-V3-0324) utilizes the same base model as the previous DeepSeek-V3-1226, with improvements made only to the post-training methods. The new V3 model incorporates reinforcement learning techniques from the training process of the DeepSeek-R1 model, significantly enhancing its performance on reasoning tasks. It has achieved scores surpassing GPT-4.5 on evaluation sets related to mathematics and coding. Additionally, the model has seen notable improvements in tool invocation, role-playing, and casual conversation capabilities.
Details
Model Provider
deepseek-ai
Type
text
Sub Type
chat
Size
671
Publish Time
Dec 26, 2024
Input Price
$
0.29
/ M Tokens
Output Price
$
1.15
/ M Tokens
Context length
65536
Tags
MoE,671B,64K