Model-model

Produk

Harga

Dokumen

Blog

Tentang

Kontak

🎉 LongCat-2.0 tersedia di SiliconFlow. Coba SEKARANG.

🎉 LongCat-2.0 tersedia di SiliconFlow. Coba SEKARANG.

Model-model

DeepSeek-R1-Distill-Qwen-1.5B

DeepSeek-R1-Distill-Qwen-1.5B

Referensi API

Tentang DeepSeek-R1-Distill-Qwen-1.5B

DeepSeek-R1-Distill-Qwen-1.5B adalah model distilled yang didasarkan pada Qwen2.5-Math-1.5B. Model ini di-tuning dengan baik menggunakan 800k sampel dikurasi yang dihasilkan oleh DeepSeek-R1 dan menunjukkan kinerja yang cukup baik di berbagai tolok ukur. Sebagai model ringan, model ini mencapai akurasi 83.9% pada MATH-500, tingkat kelulusan 28.9% pada AIME 2024, dan peringkat 954 di CodeForces, menunjukkan kemampuan penalaran yang melampaui skala parameternya

Kasus Penggunaan

Metadata

Buat di

20 Jan 2025

Lisensi

MIT

Penyedia

DeepSeek

HuggingFace

DeepSeek-R1-Distill-Qwen-1.5B

Spesifikasi

Negara

Deprecated

Arsitektur

Terkalibrasi

Tidak

Campuran Ahli

Tidak

Total Parameter

2B

Parameter yang Diaktifkan

Penalaran

Tidak

Precision

FP8

Text panjang konteks

33K

Max Tokens

Bandingkan dengan Model Lain

Lihat bagaimana model ini dibandingkan dengan yang lain.

DeepSeek

chat

DeepSeek-V3.2

Dirilis pada: 4 Des 2025

Total Context:

164K

Max output:

164K

Input:

$

0.27

/ M Tokens

Output:

$

0.42

/ M Tokens

DeepSeek

chat

DeepSeek-V3.2-Exp

Dirilis pada: 10 Okt 2025

Total Context:

164K

Max output:

164K

Input:

$

0.27

/ M Tokens

Output:

$

0.41

/ M Tokens

DeepSeek

chat

DeepSeek-V3.1-Terminus

Dirilis pada: 29 Sep 2025

Total Context:

164K

Max output:

164K

Input:

$

0.27

/ M Tokens

Output:

$

1.0

/ M Tokens

DeepSeek

chat

DeepSeek-V3.1

Dirilis pada: 25 Agu 2025

Total Context:

164K

Max output:

164K

Input:

$

0.27

/ M Tokens

Output:

$

1.0

/ M Tokens

DeepSeek

chat

DeepSeek-V3

Dirilis pada: 26 Des 2024

Total Context:

164K

Max output:

164K

Input:

$

0.25

/ M Tokens

Output:

$

1.0

/ M Tokens

DeepSeek

chat

DeepSeek-R1

Dirilis pada: 28 Mei 2025

Total Context:

164K

Max output:

164K

Input:

$

0.5

/ M Tokens

Output:

$

2.18

/ M Tokens

DeepSeek

chat

DeepSeek-R1-Distill-Qwen-32B

Dirilis pada: 20 Jan 2025

Total Context:

131K

Max output:

131K

Input:

$

0.18

/ M Tokens

Output:

$

0.18

/ M Tokens

DeepSeek

chat

DeepSeek-R1-Distill-Qwen-14B

Dirilis pada: 20 Jan 2025

Total Context:

131K

Max output:

131K

Input:

$

0.1

/ M Tokens

Output:

$

0.1

/ M Tokens

DeepSeek

chat

DeepSeek-R1-Distill-Qwen-7B

Dirilis pada: 20 Jan 2025

Total Context:

33K

Max output:

16K

Input:

$

0.05

/ M Tokens

Output:

$

0.05

/ M Tokens

Siap untuk mempercepat pengembangan AI Anda?

Siap untuk mempercepat pengembangan AI Anda?

Siap untuk mempercepat pengembangan AI Anda?

HALAMAN

MODEL

PRODUK

GPU yang Dipesan

© 2025 SiliconFlow

·

HALAMAN

MODEL

PRODUK

GPU yang Dipesan

© 2025 SiliconFlow

·

HALAMAN

MODEL

PRODUK

GPU yang Dipesan

© 2025 SiliconFlow

·