Context Length

Free Tier40k tokens
Paid TiersUp to 128k tokens

Speed

~1000
tokens/sec

Input / Output

Input Formats JSON, plain text
Output FormatsJSON, plain text, structured

Model Notes

Model ID: qwen-3-235b-a22b-instruct-2507
This model supports only non-thinking mode. It will not generate <think></think> tags.

Rate Limits

TierRequests/minInput Tokens/minOutput Tokens/minDaily Tokens
Free3060k8k/request1M

Endpoints

Chat Completions
Completions

Features

Streaming
Structured Outputs
Tool Calling

Need Higher Limits?

Reach out for custom pricing with our Enterprise tier for higher rate limits and dedicated support.Contact Sales