Question 1

How many Z.ai models are available through Requesty?

Accepted Answer

Requesty routes to 6 Z.ai models including regional variants, with pricing synced in real time to the upstream provider.

Question 2

What is the cheapest Z.ai model?

Accepted Answer

The cheapest Z.ai model starts at $0.60 per million input tokens. See the pricing column in the table below for full per-model rates.

Question 3

Does Requesty add markup on Z.ai pricing?

Accepted Answer

No. Requesty passes through exactly what Z.ai charges. You pay the same per-token rates as going direct, plus you get smart routing, caching, analytics, and one unified API for 600+ models.

Question 4

Is my data used to train Z.ai models?

Accepted Answer

Z.ai's terms state that API data is not used for training. See their privacy policy for the authoritative statement.

Question 5

Where are Z.ai models hosted?

Accepted Answer

Z.ai models are hosted in 🇸🇬 Singapore. Some models are available in additional regions through AWS Bedrock, Azure, or Google Vertex AI: filter by region on the Z.ai rows in the models explorer.

Model	Context	Max Output	Input/1M	Output/1M	Capabilities	Coding
glm-5.2	1M	128K	$1.40	$4.40	👁🧠🔧⚡	69
glm-5.1	200K	128K	$1.40	$4.40	👁🧠🔧⚡	56
GLM-5	200K	128K	$1.00	$3.20	👁🧠🔧⚡	N/A
GLM-4.7	200K	128K	$0.60	$2.20	🧠🔧	45
GLM-4.6	200K	128K	$0.60	$2.20	🧠🔧	46
GLM-4.5	131K	98K	$0.60	$2.20	🧠🔧	N/A

GLM API Pricing

All Z.ai models

About Z.ai on Requesty