Context Length

Free Tier65k tokens
Paid Tiers128k tokens

Speed

~2100
tokens/sec

Input / Output

Input Formats JSON, plain text
Output FormatsJSON, plain text, structured

Pricing

Input
$0.85 / M tokens
Output
$1.20 / M tokens
Exploration pricing shown above is per million tokens. For volume discounts and enterprise features, see our pricing page.

Model Notes

Model ID: llama3.3-70b
Available through Hugging Face and OpenRouter.

Rate Limits

TierRequests/minInput Tokens/minOutput Tokens/minDaily Tokens
Free3060k8k/request1M
1300300k30k41M
2600600k60k85M
310001M100k140M
412001.2M120k190M
514501.45M145k275M

Endpoints

Chat Completions
Completions

Features

Streaming
Structured Outputs
Tool Calling
Parallel Tool Calling

Need Higher Limits?

Reach out for custom pricing with our Enterprise tier for higher rate limits and dedicated support.Contact Sales