🎉 gemma-4-31B-it可在 SiliconFlow 上使用。現在就試試看。

模型

產品

定價

文檔

部落格

關於

聯繫

最先進的

人工智能模型庫

一個 API 可以運行 200 多個尖端 AI 模型，並在幾秒鐘內部署

最先進的

人工智能模型庫

一個 API 可以運行 200 多個尖端 AI 模型，並在幾秒鐘內部署

最先進的

人工智能模型庫

一個 API 可以運行 200 多個尖端 AI 模型，並在幾秒鐘內部署

All

Featured

LLM

Vision

Image

Video

Audio

Serverless

DeepSeek

Text Generation

DeepSeek-V4-Pro

發行日期：2026年4月24日

DeepSeek-V4-Pro is DeepSeek's flagship open-source MoE model with 1.6T total parameters and 49B activated, purpose-built for frontier-level reasoning, coding, and agentic tasks. Supporting a 1M-token context window and three reasoning effort modes up to Think Max, it achieves top-tier performance on coding benchmarks such as LiveCodeBench and Codeforces — rivaling leading closed-source models — and is released under the MIT License....

總上下文：

1049K

最大輸出：

393K

輸入：

1.74

/ M Tokens

輸入：

text

/ M Tokens

輸出：

3.48

/ M Tokens

DeepSeek

Text Generation

DeepSeek-V4-Flash

發行日期：2026年4月24日

DeepSeek-V4-Flash is DeepSeek's latest open-source MoE model featuring 284B total parameters with only 13B activated during inference, delivering high-speed generation without sacrificing capability. With native support for a 1M-token context window and three switchable reasoning modes — Non-Think, Think High, and Think Max — it offers flexible intelligence scaling from everyday tasks to complex reasoning, all under the MIT License....

總上下文：

1049K

最大輸出：

393K

輸入：

0.14

/ M Tokens

輸入：

text

/ M Tokens

輸出：

0.28

/ M Tokens

DeepSeek

Text Generation

DeepSeek-V3.2

發行日期：2025年12月4日

DeepSeek-V3.2 是一個模型，能夠將高計算效率與卓越的推理和代理性能相結合。它的方法建立在三個關鍵技術突破之上：DeepSeek Sparse Attention (DSA)，這是一種有效的注意力機制，顯著降低了計算複雜性，同時保持模型性能，特別針對長上下文場景進行了優化；一個可擴展的強化學習框架，使其性能可與 GPT-5 比肩，推理能力則可與其高計算版本的 Gemini-3.0-Pro 並駕齊驅；以及一個大規模代理任務合成管道，用於在使用工具的場景中整合推理，提高在複雜交互環境中的合規性和泛化能力。該模型在 2025 年國際數學奧林匹克(IMO)和國際信息學奧林匹克(IOI)中獲得金牌成績。...

總上下文：

164K

最大輸出：

164K

輸入：

0.27

/ M Tokens

輸入：

text

/ M Tokens

輸出：

0.42