🎉 Kimi-K3可在 SiliconFlow 上使用。現在就試試看。

模型

產品

定價

文檔

部落格

關於

聯繫

最先進的

人工智能模型庫

一個 API 可以運行 200 多個尖端 AI 模型，並在幾秒鐘內部署

最先進的

人工智能模型庫

一個 API 可以運行 200 多個尖端 AI 模型，並在幾秒鐘內部署

最先進的

人工智能模型庫

一個 API 可以運行 200 多個尖端 AI 模型，並在幾秒鐘內部署

All

Featured

LLM

Vision

Image

Video

Audio

Serverless

Z.ai

Z.ai

Text Generation

GLM-4.7

發行日期：2025年12月23日

GLM-4.7 是智譜的新一代旗艦模型，擁有355B 總參數和32B 啟用參數，在一般對話、推理和代理能力方面進行了全面升級。回應更加簡潔自然；寫作感覺更具沉浸感；工具調用指令被更可靠地執行；文物和代理編碼的前端修飾——以及長期任務完成效率——進一步提高。...

總上下文：

205K

最大輸出：

205K

輸入：

0.42

/ M Tokens

輸入：

text

/ M Tokens

輸出：

2.2

/ M Tokens

Z.ai

Text Generation

GLM-4.5-Air

發行日期：2025年7月28日

GLM-4.5 系列模型是智能代理的基础模型。GLM-4.5-Air 采用更紧凑的设计，具有 1,060 亿个总参数和 120 亿个活动参数。它还是一种混合推理模型，提供思考模式和非思考模式。...

總上下文：

131K

最大輸出：

131K

輸入：

0.14

/ M Tokens

輸入：

text

/ M Tokens

輸出：

0.86

/ M Tokens

Z.ai

Text Generation

GLM-5.2

發行日期：2026年6月17日

GLM-5.2 is Z.ai’s most capable open-source model to date, built for long-horizon agentic engineering with a truly usable 1M-token context window. It keeps project state intact across ultra-long tasks, reducing the need to compress or discard context—the longer the task, the more it can remember and reason....

總上下文：

1049K

最大輸出：

262K

輸入：

1.302

/ M Tokens

輸入：

text

/ M Tokens

輸出：

4.092

/ M Tokens

Z.ai

Text Generation

GLM-5.1

發行日期：2026年4月3日

GLM-5.1 is Z.ai's next-generation flagship model built for agentic engineering. It is designed to run continuously for hours or even longer, refining its strategy as it works—the longer it runs, the better the results....

總上下文：

205K

最大輸出：

131K

輸入：

1.19

/ M Tokens

輸入：

text

/ M Tokens

輸出：

3.74

/ M Tokens

Z.ai

Text Generation

GLM-5V-Turbo

發行日期：2026年3月30日

GLM-5V-Turbo is Zhipu’s latest flagship multimodal foundation model, optimized for multimodal coding and agent capabilities. It supports up to 200K tokens of image, video, and text context, and, when integrated with frameworks such as Claude Code and OpenClaw, can handle complex long-horizon programming and assistant tasks....

總上下文：

205K

最大輸出：

131K

輸入：

1.2

/ M Tokens

輸入：

text

/ M Tokens

輸出：

4.0

/ M Tokens

Z.ai

Text Generation

GLM-5

發行日期：2026年2月12日

GLM-5 is a next-generation open-source model for complex systems engineering and long-horizon agentic tasks, scaled to ~744B sparse parameters (~40B active) with ~28.5T pretraining tokens. It integrates DeepSeek Sparse Attention (DSA) to retain long-context capacity while reducing inference cost, and leverages the “slime” asynchronous RL stack to deliver strong performance in reasoning, coding, and agentic benchmarks....

總上下文：

205K

最大輸出：

131K

輸入：

0.95

/ M Tokens

輸入：

text

/ M Tokens

輸出：

2.55