DeepSeek-R1-0120
deepseek-ai/DeepSeek-R1-0120
DeepSeek-R1 is a reasoning model powered by reinforcement learning (RL) that addresses the issues of repetition and readability. Prior to RL, DeepSeek-R1 incorporated cold-start data to further optimize its reasoning performance. It achieves performance comparable to OpenAI-o1 across math, code, and reasoning tasks, and through carefully designed training methods, it has enhanced overall effectiveness
Details
Model Provider
deepseek-ai
Type
text
Sub Type
chat
Size
671
Publish Time
Jan 20, 2025
Input Price
$
0.58
/ M Tokens
Output Price
$
2.29
/ M Tokens
Context length
65536
Tags
Reasoning,MoE,671B,64K