DeepSeek-R1-Distill-Qwen-32B
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B
DeepSeek-R1-Distill-Qwen-32B is a distilled model based on Qwen2.5-32B. The model was fine-tuned using 800k curated samples generated by DeepSeek-R1 and demonstrates exceptional performance across mathematics, programming, and reasoning tasks. It achieved impressive results in various benchmarks including AIME 2024, MATH-500, and GPQA Diamond, with a notable 94.3% accuracy on MATH-500, showcasing its strong mathematical reasoning capabilities
Details
Model Provider
deepseek-ai
Type
text
Sub Type
chat
Size
32
Publish Time
Jan 20, 2025
Input Price
$
0.18
/ M Tokens
Output Price
$
0.18
/ M Tokens
Context length
32768
Tags
Reasoning,32B,32K