🎉 LongCat-2.0 tersedia di SiliconFlow. Coba SEKARANG.

Model-model

Produk

Harga

Dokumen

Blog

Tentang

Kontak

State-of-the-Art

AI Model Library

One API to run inference on 200+ cutting-edge AI models, and deploy in seconds

State-of-the-Art

AI Model Library

One API to run inference on 200+ cutting-edge AI models, and deploy in seconds

State-of-the-Art

AI Model Library

One API to run inference on 200+ cutting-edge AI models, and deploy in seconds

All

Featured

LLM

Vision

Image

Video

Audio

Serverless

Tencent

Tencent

Text Generation

Hunyuan-A13B-Instruct

Dirilis pada: 30 Jun 2025

Hunyuan-A13B-Instruct mengaktifkan hanya 13 B dari 80 B parameternya, namun sebanding dengan LLM yang jauh lebih besar pada tolok ukur arus utama. Ini menawarkan penalaran hibrida: mode “cepat” latensi rendah atau mode “lambat” presisi tinggi, dapat dialihkan per panggilan. Konteks 256 K-token asli memungkinkan untuk mencerna dokumen sepanjang buku tanpa degradasi. Keterampilan agen disesuaikan untuk kepemimpinan BFCL-v3, τ-Bench, dan C3-Bench, menjadikannya tulang punggung asisten otonom yang sangat baik. Grouped Query Attention plus kuantisasi multi-format memberikan inferensi yang ringan-memori, efisien-GPU untuk penerapan dunia nyata, dengan dukungan multibahasa bawaan dan penyelarasan keselamatan yang kuat untuk aplikasi kelas perusahaan....

Total Context:

131K

Max output:

131K

Input:

0.14

/ M Tokens

Input:

text

/ M Tokens

Output:

0.57

/ M Tokens

Tencent

Text Generation

Hy3

Dirilis pada: 26 Jun 2026

Built for real-world business scenarios, Hy3 features a 295B/21B active MoE architecture, native 256K context support, and three reasoning modes. It enhances coding, long-form comprehension, multi-turn dialogue, and agentic task execution, balancing reliability, efficiency, and cost across both high-frequency interactions and complex workflows....

Total Context:

262K

Max output:

262K

Input:

0.0

/ M Tokens

Input:

text

/ M Tokens

Output:

0.0

/ M Tokens

Tencent

Text Generation

Hy3-preview

Dirilis pada: 7 Apr 2026

Hy3 preview is a 295B-parameter Mixture-of-Experts (MoE) language model from Tencent Hunyuan, built for production-grade agent workloads. With only 21B parameters activated per token and native 256K context support, it handles complex tasks like cross-file code refactoring, long-document analysis, and multi-step tool use, rather than just generating fluent dialogue. Hy3 scores near state-of-the-art on SWE-bench Verified and advanced STEM benchmarks, while offering three inference modes (no_think, think_low, think_high) to dynamically trade off latency and reasoning depth. Its sparse activation architecture delivers competitive intelligence at a significantly lower token cost....

Total Context:

262K

Max output:

262K

Input:

0.066

/ M Tokens

Input:

text

/ M Tokens

Output:

0.26