GLM-4.5V
About GLM-4.5V
As a part of the GLM-V family of models, GLM-4.5V is based on ZhipuAI’s foundation model GLM-4.5-Air, achieving SOTA performance on tasks such as image, video, and document understanding, as well as GUI agent operations.
Discover how GLM-4.5V's advanced multimodal reasoning powers innovative solutions across diverse real-world applications.
Multimodal Content Intelligence
Extract deep insights from diverse visual and textual content, including images, videos, and complex documents, for comprehensive analysis and reporting.
Use Case Example:
"Automatically summarized key events and identified specific objects in a 30-minute manufacturing surveillance video, generating a timestamped report for quality control."
Intelligent GUI Automation
Empower AI agents to interact with web, desktop, and mobile interfaces, performing complex tasks through visual understanding and precise action.
Use Case Example:
"Developed an agent that navigates a legacy Java-based ERP system, extracts specific order details, and inputs them into a modern cloud-based logistics platform, reducing manual processing time by 60%."
Deep Document & Chart Analysis
Analyze intricate financial reports, scientific papers, and technical schematics, extracting structured data, identifying trends, and generating detailed summaries.
Use Case Example:
"Processed a 150-page pharmaceutical research paper, extracting key experimental results from embedded charts and tables, and summarizing drug efficacy and safety profiles for regulatory review."
Visual QA & Anomaly Detection
Automate quality control by visually inspecting products, manufacturing lines, or digital assets, identifying defects, inconsistencies, or deviations from standards.
Use Case Example:
"Monitored a food packaging line via high-resolution cameras, detecting mislabeled products and packaging defects in real-time, preventing faulty items from reaching consumers."
Metadata
Specification
State
Deprecated
Architecture
Calibrated
Yes
Mixture of Experts
Yes
Total Parameters
106B
Activated Parameters
12B
Reasoning
No
Precision
FP8
Context length
66K
Max Tokens
66K
Compare with Other Models
See how this model stacks up against others.

Z.ai
chat
GLM-5
Release on: Feb 12, 2026
Total Context:
205K
Max output:
131K
Input:
$
0.3
/ M Tokens
Output:
$
2.55
/ M Tokens

Z.ai
chat
GLM-4.7
Release on: Dec 23, 2025
Total Context:
205K
Max output:
205K
Input:
$
0.42
/ M Tokens
Output:
$
2.2
/ M Tokens

Z.ai
chat
GLM-4.6V
Release on: Dec 8, 2025
Total Context:
131K
Max output:
131K
Input:
$
0.3
/ M Tokens
Output:
$
0.9
/ M Tokens

Z.ai
chat
GLM-4.6
Release on: Oct 4, 2025
Total Context:
205K
Max output:
205K
Input:
$
0.39
/ M Tokens
Output:
$
1.9
/ M Tokens

Z.ai
chat
GLM-4.5-Air
Release on: Jul 28, 2025
Total Context:
131K
Max output:
131K
Input:
$
0.14
/ M Tokens
Output:
$
0.86
/ M Tokens

Z.ai
chat
GLM-4.5V
Release on: Aug 13, 2025
Total Context:
66K
Max output:
66K
Input:
$
0.14
/ M Tokens
Output:
$
0.86
/ M Tokens

Z.ai
chat
GLM-4.1V-9B-Thinking
Release on: Jul 4, 2025
Total Context:
66K
Max output:
66K
Input:
$
0.035
/ M Tokens
Output:
$
0.14
/ M Tokens

Z.ai
chat
GLM-Z1-32B-0414
Release on: Apr 18, 2025
Total Context:
131K
Max output:
131K
Input:
$
0.14
/ M Tokens
Output:
$
0.57
/ M Tokens

Z.ai
chat
GLM-4-32B-0414
Release on: Apr 18, 2025
Total Context:
33K
Max output:
33K
Input:
$
0.27
/ M Tokens
Output:
$
0.27
/ M Tokens
