This model will be deprecated on November 3, 2025
Model ID:
llama-4-scout-17b-16e-instruct
Model Stats
SPEED
~2600
tokens/sec
INPUT / OUTPUT
/
CONTEXT
Free Tier
8k tokens
Paid Tiers
32k tokens
MAX OUTPUT
Free Tier
8k tokens
Paid Tiers
32k tokens
Pricing
Input
$0.65 / M tokens
Output
$0.85 / M tokens
Developer pricing shown above is per million tokens. For volume discounts and enterprise features, see our pricing page.
Rate Limits
Tier | Requests/min | Input Tokens/min | Daily Tokens |
---|---|---|---|
Free | 30 | 60k | 1M |
Endpoints
Chat Completions
Completions
Capabilities
Streaming
Structured Outputs
Tool Calling
Tool Calling w/ Structured Outputs