GLM-Z1-32B-0414

GLM-Z1-32B-0414

THUDM/GLM-Z1-32B-0414

About GLM-Z1-32B-0414

GLM-Z1-32B-0414 is a reasoning model with deep thinking capabilities. This model was developed based on GLM-4-32B-0414 through cold start and extended reinforcement learning, as well as further training on tasks involving mathematics, code, and logic. Compared to the base model, GLM-Z1-32B-0414 significantly improves mathematical abilities and the capability to solve complex tasks. During the training process, the team also introduced general reinforcement learning based on pairwise ranking feedback, further enhancing the model's general capabilities. Despite having only 32B parameters, its performance on certain tasks is comparable to DeepSeek-R1 with 671B parameters. Through evaluations on benchmarks such as AIME 24/25, LiveCodeBench, and GPQA, the model demonstrates strong mathematical reasoning abilities and can support solutions for a wider range of complex tasks

Available Serverless

Run queries immediately, pay only for usage

$

0.14

/

$

0.57

Per 1M Tokens (input/output)

Metadata

Create on

Apr 18, 2025

License

mit

Provider

Z.ai

HuggingFace

Specification

State

Available

Architecture

Calibrated

No

Mixture of Experts

No

Total Parameters

32

Activated Parameters

32B

Reasoning

No

Precision

FP8

Context length

131K

Max Tokens

131K

Supported Functionality

Serverless

Supported

Serverless LoRA

Not supported

Fine-tuning

Not supported

Embeddings

Not supported

Rerankers

Not supported

Support image input

Not supported

JSON Mode

Supported

Structured Outputs

Not supported

Tools

Supported

Fim Completion

Not supported

Chat Prefix Completion

Not supported

Model FAQs: Usage, Deployment

Learn how to use, fine-tune, and deploy this model with ease.

Ready to accelerate your AI development?

Ready to accelerate your AI development?