
GLM API Pricing
GLM models from Z.ai. Z AI (formerly Zhipu AI) provides advanced large language models with strong agentic and coding capabilities. Requesty routes to 5 Z.ai models starting at $0.60 per 1M input tokens with context windows up to 200K tokens. One API key, OpenAI-compatible SDK, no markup.
Flagship model
GLM-5.1Intelligence Index
51.4
Coding Index
43.4
GPQA Diamond
86.8%
Terminal-Bench Hard
43.2%
All Z.ai models
About Z.ai on Requesty
How many Z.ai models are available through Requesty?
Requesty routes to 5 Z.ai models including regional variants, with pricing synced in real time to the upstream provider.
What is the cheapest Z.ai model?
The cheapest Z.ai model starts at $0.60 per million input tokens. See the pricing column in the table below for full per-model rates.
Does Requesty add markup on Z.ai pricing?
No. Requesty passes through exactly what Z.ai charges. You pay the same per-token rates as going direct, plus you get smart routing, caching, analytics, and one unified API for 400+ models.
Is my data used to train Z.ai models?
Z.ai's terms state that API data is not used for training. See their privacy policy for the authoritative statement.
Where are Z.ai models hosted?
Z.ai models are hosted in πΈπ¬ Singapore. Some models are available in additional regions through AWS Bedrock, Azure, or Google Vertex AI: filter by region on the Z.ai rows in the models explorer.
