Models

Products

Pricing

Docs

Blog

About

Contact

🎉 LongCat-2.0 is available on SiliconFlow. Try it NOW.

🎉 LongCat-2.0 is available on SiliconFlow. Try it NOW.

Models

GLM-Z1-Rumination-32B-0414

GLM-Z1-Rumination-32B-0414

API Reference

About GLM-Z1-Rumination-32B-0414

GLM-Z1-Rumination-32B-0414 is a deep reasoning model with rumination capabilities (benchmarked against OpenAI's Deep Research). Unlike typical deep thinking models, the rumination model employs longer periods of deep thought to solve more open-ended and complex problems (e.g., writing a comparative analysis of AI development in two cities and their future development plans). The rumination model integrates search tools during its deep thinking process to handle complex tasks and is trained by utilizing multiple rule-based rewards to guide and extend end-to-end reinforcement learning. Z1-Rumination shows significant improvements in research-style writing and complex retrieval tasks. The model supports a complete research cycle of “independently raising questions—searching for information—building analysis—completing tasks” and includes function calls like search, click, open, and finish by default, enabling it to better handle complex problems that require external information

Use Case

Metadata

Create on

Apr 18, 2025

License

MIT

Provider

Z.ai

HuggingFace

GLM-Z1-Rumination-32B-0414

Specification

State

Deprecated

Architecture

Calibrated

No

Mixture of Experts

No

Total Parameters

32B

Activated Parameters

Reasoning

No

Precision

FP8

Context length

33K

Max Tokens

Compare with Other Models

See how this model stacks up against others.

Z.ai

GLM-4.7

Release on: Dec 23, 2025

Total Context:

205K

Max output:

205K

Input:

$

0.42

/ M Tokens

Output:

$

2.2

/ M Tokens

Z.ai

chat

GLM-4.6V

Release on: Dec 8, 2025

Total Context:

131K

Max output:

131K

Input:

$

0.3

/ M Tokens

Output:

$

0.9

/ M Tokens

Z.ai

chat

GLM-4.6

Release on: Oct 4, 2025

Total Context:

205K

Max output:

205K

Input:

$

0.39

/ M Tokens

Output:

$

1.9

/ M Tokens

Z.ai

chat

GLM-4.5-Air

Release on: Jul 28, 2025

Total Context:

131K

Max output:

131K

Input:

$

0.14

/ M Tokens

Output:

$

0.86

/ M Tokens

Z.ai

chat

GLM-4.5V

Release on: Aug 13, 2025

Total Context:

66K

Max output:

66K

Input:

$

0.14

/ M Tokens

Output:

$

0.86

/ M Tokens

Z.ai

chat

GLM-4.1V-9B-Thinking

Release on: Jul 4, 2025

Total Context:

66K

Max output:

66K

Input:

$

0.035

/ M Tokens

Output:

$

0.14

/ M Tokens

Z.ai

chat

GLM-Z1-32B-0414

Release on: Apr 18, 2025

Total Context:

131K

Max output:

131K

Input:

$

0.14

/ M Tokens

Output:

$

0.57

/ M Tokens

Z.ai

chat

GLM-4-32B-0414

Release on: Apr 18, 2025

Total Context:

33K

Max output:

33K

Input:

$

0.27

/ M Tokens

Output:

$

0.27

/ M Tokens

Z.ai

chat

GLM-Z1-9B-0414

Release on: Apr 18, 2025

Total Context:

131K

Max output:

131K

Input:

$

0.086

/ M Tokens

Output:

$

0.086

/ M Tokens

Ready to accelerate your AI development?

Ready to accelerate your AI development?

Ready to accelerate your AI development?

PAGES

MODELS

PRODUCTS

© 2026 SiliconFlow

·

PAGES

MODELS

PRODUCTS

© 2026 SiliconFlow

·

PAGES

MODELS

PRODUCTS

© 2026 SiliconFlow

·