This model has been deprecated as of October 15, 2025
Model Stats
SPEED
~2400
tokens/sec
INPUT / OUTPUT
/
CONTEXT
Free Tier
8k tokens
Paid Tiers
32k tokens
MAX OUTPUT
Free Tier
N/A
Paid Tiers
N/A
Pricing
Input
$0.20 / M tokens
Output
$0.60 / M tokens
Developer pricing shown above is per million tokens. For volume discounts and enterprise features, see our pricing page.
Model Notes
Model ID:
llama-4-maverick-17b-128e-instruct We recommend setting
temperature=0.6, min_p=0.01, and top_p=0.9.Rate Limits
| Tier | Requests/min | Input Tokens/min | Daily Tokens |
|---|---|---|---|
| Free | 30 | 60k | 1M |
Endpoints
Chat CompletionsCompletionsFeatures
Streaming
Structured Outputs
Tool Calling

