DeepSeek-R1-Distill-Llama-70B
deepseek-ai/DeepSeek-R1-Distill-Llama-70B
DeepSeek-R1-Distill-Llama-70B is a distilled model based on Llama-3.3-70B-Instruct. As part of the DeepSeek-R1 series, it was fine-tuned using samples generated by DeepSeek-R1 and demonstrates excellent performance across mathematics, programming, and reasoning tasks. The model achieved impressive results in various benchmarks including AIME 2024, MATH-500, and GPQA Diamond, showcasing its strong reasoning capabilities
Details
Model Provider
deepseek-ai
Type
text
Sub Type
chat
Size
70
Publish Time
Jan 20, 2025
Input Price
$
0.59
/ M Tokens
Output Price
$
0.59
/ M Tokens
Context length
32768
Tags
Reasoning,70B,32K