🎉 LongCat-2.0 доступно на SiliconFlow. Попробуйте это СЕЙЧАС.

Модели

Продукты

Цены

Документация

Блог

О

Контакт

Современный

Библиотека моделей ИИ

Один API для запуска Inference на более чем 200 передовых AI Models и развертывания за считанные секунды

Современный

Библиотека моделей ИИ

Один API для запуска Inference на более чем 200 передовых AI Models и развертывания за считанные секунды

Современный

Библиотека моделей ИИ

Один API для запуска Inference на более чем 200 передовых AI Models и развертывания за считанные секунды

All

Featured

LLM

Vision

Image

Video

Audio

Serverless

Z.ai

Z.ai

Text Generation

GLM-4.7

Выпуск: 23 дек. 2025 г.

GLM-4.7 — это новая флагманская модель компании Zhipu, с общим количеством параметров 355 миллиардов и 32 миллиарда активированных параметров, обеспечивающая комплексные обновления в области общих разговоров, рассуждений и возможностей агентов. Ответы стали более лаконичными и естественными; писательство ощущается более захватывающим; инструкции по вызову инструментов выполняются более надежно; и передний конечный блеск артефактов и агентского кодирования, вместе с эффективностью выполнения задач на большие расстояния, был дополнительно улучшен....

Total Context:

205K

Max output:

205K

Input:

0.42

/ M Tokens

Input:

text

/ M Tokens

Output:

2.2

/ M Tokens

Z.ai

Text Generation

GLM-4.5-Air

Выпуск: 28 июл. 2025 г.

Серия моделей GLM-4.5 являются основными моделями, разработанными для интеллектуальных агентов. GLM-4.5-Air использует более компактный дизайн с 106 миллиардами общих параметров и 12 миллиардами активных параметров. Это также гибридная модель, обеспечивающая как режим мышления, так и режим без мышления....

Total Context:

131K

Max output:

131K

Input:

0.14

/ M Tokens

Input:

text

/ M Tokens

Output:

0.86

/ M Tokens

Z.ai

Text Generation

GLM-5.2

Выпуск: 17 июн. 2026 г.

GLM-5.2 is Z.ai’s most capable open-source model to date, built for long-horizon agentic engineering with a truly usable 1M-token context window. It keeps project state intact across ultra-long tasks, reducing the need to compress or discard context—the longer the task, the more it can remember and reason....

Total Context:

1049K

Max output:

262K

Input:

1.302

/ M Tokens

Input:

text

/ M Tokens

Output:

4.092

/ M Tokens

Z.ai

Text Generation

GLM-5.1

Выпуск: 3 апр. 2026 г.

GLM-5.1 is Z.ai's next-generation flagship model built for agentic engineering. It is designed to run continuously for hours or even longer, refining its strategy as it works—the longer it runs, the better the results....

Total Context:

205K

Max output:

131K

Input:

1.19

/ M Tokens

Input:

text

/ M Tokens

Output:

3.74

/ M Tokens

Z.ai

Text Generation

GLM-5V-Turbo

Выпуск: 30 мар. 2026 г.

GLM-5V-Turbo is Zhipu’s latest flagship multimodal foundation model, optimized for multimodal coding and agent capabilities. It supports up to 200K tokens of image, video, and text context, and, when integrated with frameworks such as Claude Code and OpenClaw, can handle complex long-horizon programming and assistant tasks....

Total Context:

205K

Max output:

131K

Input:

1.2

/ M Tokens

Input:

text

/ M Tokens

Output:

4.0

/ M Tokens

Z.ai

Text Generation

GLM-5

Выпуск: 12 февр. 2026 г.

GLM-5 is a next-generation open-source model for complex systems engineering and long-horizon agentic tasks, scaled to ~744B sparse parameters (~40B active) with ~28.5T pretraining tokens. It integrates DeepSeek Sparse Attention (DSA) to retain long-context capacity while reducing inference cost, and leverages the “slime” asynchronous RL stack to deliver strong performance in reasoning, coding, and agentic benchmarks....

Total Context:

205K

Max output:

131K

Input:

0.95

/ M Tokens

Input:

text

/ M Tokens

Output:

2.55