Skip to main content
Model ID: zai-glm-4.6
Model card

Model Stats

SPEED
~1000
tokens/sec
INPUT / OUTPUT
/
CONTEXT
Free Tier
131k tokens
Paid Tiers
131k tokens
MAX OUTPUT
Free Tier
40k tokens
Paid Tiers
40k tokens

Pricing

Input
$2.25 / M tokens
Output
$2.75 / M tokens
Exploration pricing shown above is per million tokens. For volume discounts and enterprise features, see our pricing page.

Model Notes

Reasoning is enabled by default for this model. To disable it, see the reasoning guide.

Rate Limits

TierRequests/minInput Tokens/minDaily Tokens
Free10150k1M
Developer500500kN/A

Endpoints

Chat Completions

Capabilities

Reasoning
Streaming
Structured Outputs
Tool Calling
Parallel Tool Calling

Need Higher Limits?

Reach out for custom pricing with our Enterprise tier for higher rate limits and dedicated support.