Z AI

Z AI (formerly Zhipu AI) provides advanced large language models with strong agentic and coding capabilities.

πŸ“ πŸ‡ΈπŸ‡¬ Singaporeβ€’2 models availableβ€’Visit Website β†’
2
Available Models
$0.6
Avg Input Price/M
$0.6
Cheapest Model
zai/GLM-4.6
$0.6
Most Expensive
zai/GLM-4.6

Features Overview

0
Vision Support
2
Advanced Reasoning
0
Caching Support
0
Computer Use

Privacy & Data Policy

Data Retention

No data retention

Location

πŸ‡ΈπŸ‡¬ Singapore
Reasoning
Context Window
200K tokens
Max Output
128K tokens
Input
$0.6/M tokens
Output
$2.20/M tokens

GLM-4.6 is Z AI’s latest flagship model, designed to push agentic and coding performance further. It expands the context window from 128K to 200K tokens, improves reasoning and tool-use capabilities, and delivers stronger results in coding benchmarks and real-world development workflows. GLM-4.6 demonstrates refined writing quality, more capable agent behavior, and higher token efficiency (β‰ˆ15% fewer tokens vs. GLM-4.5). Evaluations show clear gains over GLM-4.5 across reasoning, agents, and coding, reaching near parity with Claude Sonnet 4 in practical tasks while outperforming other open-source baselines. GLM-4.6 is available through the Z.ai API platform, OpenRouter, coding agents (Claude Code, Roo Code, Cline, Kilo Code), and soon as downloadable weights on HuggingFace and ModelScope.

Reasoning
Context Window
131K tokens
Max Output
98K tokens
Input
$0.6/M tokens
Output
$2.20/M tokens

GLM-4.5 and GLM-4.5-Air are Z AI's latest flagship models, purpose-built as foundational models for agent-oriented applications. Both leverage a Mixture-of-Experts (MoE) architecture. GLM-4.5 has a total parameter count of 355B with 32B active parameters per forward pass, while GLM-4.5-Air adopts a more streamlined design with 106B total parameters and 12B active parameters.

Ready to use Z AI models?

Access all Z AI models through Requesty's unified API with intelligent routing, caching, and cost optimization.

Z AI AI Models - Pricing & Features | Requesty