🎉 Kimi-K3可在 SiliconFlow 上使用。現在就試試看。

模型

產品

定價

文檔

部落格

關於

聯繫

最先進的

人工智能模型庫

一個 API 可以運行 200 多個尖端 AI 模型，並在幾秒鐘內部署

最先進的

人工智能模型庫

一個 API 可以運行 200 多個尖端 AI 模型，並在幾秒鐘內部署

最先進的

人工智能模型庫

一個 API 可以運行 200 多個尖端 AI 模型，並在幾秒鐘內部署

All

Featured

LLM

Vision

Image

Video

Audio

Serverless

Tencent

Tencent

Text Generation

Hunyuan-A13B-Instruct

發行日期：2025年6月30日

Hunyuan-A13B-Instruct 僅啟用其 80 B 參數中的 13 B，卻能在主流基準上匹敵更大的 LLMs。它提供混合推理：每次呼叫可切換為低延遲“快速”模式或高精度“慢速”模式。內建 256 K-token 上下文，允許它在不減低功效的情況下解析書籍長度的文件。代理技能為 BFCL-v3、τ-Bench 和 C3-Bench 領導力而調校，使其成為優秀的自主助手基礎。分組查詢注意力和多格式量化提供記憶體輕量、GPU 高效的推理，適合現實世界的部署，並具備內建多語言支持和堅固的安全對齊，適用於企業級應用。...

總上下文：

131K

最大輸出：

131K

輸入：

0.14

/ M Tokens

輸入：

text

/ M Tokens

輸出：

0.57

/ M Tokens

Tencent

Text Generation

Hy3

發行日期：2026年6月26日

Built for real-world business scenarios, Hy3 features a 295B/21B active MoE architecture, native 256K context support, and three reasoning modes. It enhances coding, long-form comprehension, multi-turn dialogue, and agentic task execution, balancing reliability, efficiency, and cost across both high-frequency interactions and complex workflows....

總上下文：

262K

最大輸出：

262K

輸入：

0.132

/ M Tokens

輸入：

text

/ M Tokens

輸出：

0.528

/ M Tokens

Tencent

Text Generation

Hy3-preview

發行日期：2026年4月7日

Hy3 preview is a 295B-parameter Mixture-of-Experts (MoE) language model from Tencent Hunyuan, built for production-grade agent workloads. With only 21B parameters activated per token and native 256K context support, it handles complex tasks like cross-file code refactoring, long-document analysis, and multi-step tool use, rather than just generating fluent dialogue. Hy3 scores near state-of-the-art on SWE-bench Verified and advanced STEM benchmarks, while offering three inference modes (no_think, think_low, think_high) to dynamically trade off latency and reasoning depth. Its sparse activation architecture delivers competitive intelligence at a significantly lower token cost....

總上下文：

262K

最大輸出：

262K

輸入：

0.066

/ M Tokens

輸入：

text

/ M Tokens

輸出：

0.26