Z AI

GLM-4.6

GLM-4.6 is Z AI’s latest flagship model, designed to push agentic and coding performance further. It expands the context window from 128K to 200K tokens, improves reasoning and tool-use capabilities, and delivers stronger results in coding benchmarks and real-world development workflows. GLM-4.6 demonstrates refined writing quality, more capable agent behavior, and higher token efficiency (β‰ˆ15% fewer tokens vs. GLM-4.5). Evaluations show clear gains over GLM-4.5 across reasoning, agents, and coding, reaching near parity with Claude Sonnet 4 in practical tasks while outperforming other open-source baselines. GLM-4.6 is available through the Z.ai API platform, OpenRouter, coding agents (Claude Code, Roo Code, Cline, Kilo Code), and soon as downloadable weights on HuggingFace and ModelScope.

Advanced Reasoning

Pricing

$0.6
Input tokens per million
$2.20
Output tokens per million

Technical Specifications

Context Window
200K tokens
Max Output Tokens
128K tokens
Global Availability
Last Updated
N/A

Provider

Z AI
Location
πŸ‡ΈπŸ‡¬ Singapore
Visit Website β†’

Privacy & Data

Data Retention
No
Used for Training
No
Z AI Privacy Policy β†’
GLM-4.6 - AI Model Details | Requesty