Skip to main content
This model will be deprecated on November 3, 2025
Model ID: llama-4-scout-17b-16e-instruct

Model Stats

SPEED
~2600
tokens/sec
INPUT / OUTPUT
/
CONTEXT
Free Tier
8k tokens
Paid Tiers
32k tokens
MAX OUTPUT
Free Tier
8k tokens
Paid Tiers
32k tokens

Pricing

Input
$0.65 / M tokens
Output
$0.85 / M tokens
Developer pricing shown above is per million tokens. For volume discounts and enterprise features, see our pricing page.

Rate Limits

TierRequests/minInput Tokens/minDaily Tokens
Free3060k1M

Endpoints

Chat Completions
Completions

Capabilities

Streaming
Structured Outputs
Tool Calling
Tool Calling w/ Structured Outputs

Need Higher Limits?

Reach out for custom pricing with our Enterprise tier for higher rate limits and dedicated support.
I