Model ID:
llama3.3-70bModel Stats
SPEED
~2100
tokens/sec
INPUT / OUTPUT
/
CONTEXT
Free Tier
65k tokens
Paid Tiers
128k tokens
MAX OUTPUT
Free Tier
8k tokens
Paid Tiers
65k tokens
Pricing
Input
$0.85 / M tokens
Output
$1.20 / M tokens
Developer pricing shown above is per million tokens. For volume discounts and enterprise features, see our pricing page.
Rate Limits
| Tier | Requests/min | Input Tokens/min | Daily Tokens |
|---|---|---|---|
| Free | 30 | 60k | 1M |
| Developer | 1K | 1M | N/A |
Endpoints
Chat CompletionsCompletionsFeatures
Streaming
Structured Outputs
Tool Calling
Parallel Tool Calling

