We currently offer three tiers of pricing, each suited for different types of users. Please reach out to us to discuss your requirements, and we can work with you to find a solution that fits your needs.

Free


$0


Lower rate limits
Community support

Exploration


Pay per token

coming soon

Up to 100 RPM
Community support

Enterprise


Custom pricing


Custom rate limits
Fine-tuned models
Custom SLAs
Dedicated support

Exploration Tier Pricing

Exploration tier pricing is not yet available. Follow this page for updates.
ModelSpeedInputOutput
Llama 4 Scout~2600 tokens/s$0.65/M tokens$0.85/M tokens
Llama 3.1 8B~2200 tokens/s$0.10/M tokens$0.10/M tokens
Llama 3.3 70B~2100 tokens/s$0.85/M tokens$1.20/M tokens
Deepseek R1 Distill Llama 70B~1700 tokens/s$2.20/M tokens$2.50/M tokens

Enterprise Tier Pricing

Our enterprise tier offers flat monthly pricing with flexible contract terms of 3, 6, or 12 months. Your monthly rate is based on your required token processing capacity, specifically the maximum number of input and output tokens you need to process per minute. Contact us for a trial package.

In addition to the models available in the free and exploration tiers, enterprise customers have access to:

  • Llama 3.1 405B
  • Mixtral 8x22B
  • Mixtral 8x7B