Skip to main content
This model has been deprecated as of October 15, 2025

Model Stats

SPEED
~2400
tokens/sec
INPUT / OUTPUT
/
CONTEXT
Free Tier
8k tokens
Paid Tiers
32k tokens
MAX OUTPUT
Free Tier
N/A
Paid Tiers
N/A

Pricing

Input
$0.20 / M tokens
Output
$0.60 / M tokens
Developer pricing shown above is per million tokens. For volume discounts and enterprise features, see our pricing page.

Model Notes

Model ID: llama-4-maverick-17b-128e-instruct
We recommend setting temperature=0.6, min_p=0.01, and top_p=0.9.

Rate Limits

TierRequests/minInput Tokens/minDaily Tokens
Free3060k1M

Endpoints

Chat Completions
Completions

Features

Streaming
Structured Outputs
Tool Calling

Need Higher Limits?

Reach out for custom pricing with our Enterprise tier for higher rate limits and dedicated support.