Context Length
Free Tier65k tokens
Paid Tiers128k tokens
Speed
~2100
tokens/sec
Input / Output
Input Formats JSON, plain text
Output FormatsJSON, plain text, structured
Pricing
Input
$0.85 / M tokens
Output
$1.20 / M tokens
Exploration pricing shown above is per million tokens. For volume discounts and enterprise features, see our pricing page.
Model Notes
Model ID:
llama3.3-70b
Available through Hugging Face and OpenRouter.
Rate Limits
Tier | Requests/min | Input Tokens/min | Output Tokens/min | Daily Tokens |
---|---|---|---|---|
Free | 30 | 60k | 8k/request | 1M |
1 | 300 | 300k | 30k | 41M |
2 | 600 | 600k | 60k | 85M |
3 | 1000 | 1M | 100k | 140M |
4 | 1200 | 1.2M | 120k | 190M |
5 | 1450 | 1.45M | 145k | 275M |
Endpoints
Chat Completions
Completions
Features
Streaming
Structured Outputs
Tool Calling
Parallel Tool Calling