Documentation Index
Fetch the complete documentation index at: https://inference-docs.cerebras.ai/llms.txt
Use this file to discover all available pages before exploring further.
Overview
Cerebras Inference is available through AWS Marketplace, allowing you to access our ultra-fast inference models with billing consolidated through your AWS account. This integration is ideal for organizations that want to:- Consolidate AI infrastructure costs with existing AWS spending
- Leverage AWS Enterprise Discount Programs (EDP) and committed spend
- Simplify procurement and billing processes
- Access Cerebras models without separate vendor management
Prerequisites
Before you begin, ensure you have:- AWS Account - An active AWS account with permissions to subscribe to Marketplace offerings. Visit AWS Marketplace if you need to create an account.
- Cerebras Cloud Account - You’ll create or link this during the subscription process
Subscribe to Cerebras on AWS Marketplace
Find Cerebras in AWS Marketplace
- Log in to the AWS Marketplace
- Search for “Cerebras” in the marketplace search bar
- Select the Cerebras Fast Inference Cloud listing from the search results
Review and subscribe
- On the Cerebras listing page, select View purchase options
- Review the pricing details and terms carefully
- Click Subscribe to proceed with your subscription
Set up your Cerebras account
- Click Set up your account on the confirmation page
- You’ll be redirected to
https://cloud.cerebras.ai/dashboard - Either log in to your existing Cerebras account or create a new one
- Your Cerebras account will automatically be linked to your AWS Marketplace subscription
Generate your API key
- Navigate to the API Keys page
- Click Create API Key
- Give your key a descriptive name (e.g., “Production API Key” or “AWS Marketplace Key”)
- Copy and securely store your API key - you won’t be able to see it again
Using Cerebras Inference
Once you’ve completed the subscription and setup, you can start using Cerebras Inference with your API key. For detailed integration instructions, code examples, and API documentation, visit:X-Cerebras-3rd-Party-Integration: aws-marketplace to ensure your usage is properly tracked for AWS Marketplace billing.Available Models
Cerebras offers several state-of-the-art models through AWS Marketplace:| Model | Description | Best For |
|---|---|---|
| llama3.1-8b | Smaller, faster Llama model | Fastest option for simple tasks and high-throughput scenarios |
| gpt-oss-120b | Large open-source model | Largest model for the most demanding tasks |
| zai-glm-4.7 | Advanced 357B parameter model | Advanced 357B parameter model with strong reasoning capabilities |
Account and Billing Management
Viewing Usage
You can monitor your API usage in two places: Cerebras Dashboard Visit the Usage page to view detailed metrics:- Total requests and tokens processed
- Usage breakdown by model
- Usage breakdown by API key
- Real-time usage metrics and trends
- Historical usage data
- Go to your AWS Marketplace subscriptions
- Select your username in the upper-right corner
- Choose Your AWS Marketplace Software
- Select Manage Subscriptions
- Find Cerebras Inference in your active subscriptions
Understanding Billing
All billing is handled through your AWS account with transparent, usage-based pricing:- Billing Location: AWS Billing and Cost Management console
- Billing Frequency: Monthly, based on API usage
- Pricing Model: Per-million tokens for inputs and outputs (varies by model)
- AWS Integration: Charges appear as AWS Marketplace line items
- Payment Method: Uses your existing AWS payment method
- In the AWS Console, select your username in the upper-right corner
- Choose Billing and Cost Management
- Navigate to Bills or Cost Explorer
- Look for Cerebras charges under AWS Marketplace
Managing Your Subscription
To modify or cancel your subscription:- Navigate to the AWS Marketplace console
- Select Manage Subscriptions
- Find your Cerebras Inference subscription
- Choose the action you want to take (modify, cancel, etc.)
Next Steps
Now that you’re set up with Cerebras through AWS Marketplace, explore these resources to get the most out of your integration:API Reference
Model Documentation
Best Practices
GLM4.7 Migration Guide
FAQ
Do my AWS and Cerebras account emails need to match?
Do my AWS and Cerebras account emails need to match?
How long does it take for charges to appear in AWS billing?
How long does it take for charges to appear in AWS billing?
Can I use my AWS Enterprise Discount Program (EDP) credits?
Can I use my AWS Enterprise Discount Program (EDP) credits?
What happens if I cancel my AWS Marketplace subscription?
What happens if I cancel my AWS Marketplace subscription?
Can I have multiple API keys for the same AWS subscription?
Can I have multiple API keys for the same AWS subscription?
How do I increase my rate limits?
How do I increase my rate limits?
Troubleshooting
Authentication Errors
Authentication Errors
- Verify your API key is correct in your
.envfile - Ensure your AWS Marketplace subscription is active
- Check that your Cerebras account is properly linked to AWS
- Confirm you’re using the correct base URL:
https://api.cerebras.ai/v1 - Try generating a new API key from the Cerebras dashboard
- Verify the
X-Cerebras-3rd-Party-Integrationheader is set toaws-marketplace
Billing Questions
Billing Questions
- Charges appear monthly, not in real-time
- Check the AWS Marketplace section of your billing console
- Allow 24-48 hours for charges to appear after usage
- Verify your subscription is active in AWS Marketplace
- Contact AWS Support for billing-specific questions
Account Linking Issues
Account Linking Issues
- Clear your browser cache and cookies
- Try the setup process in an incognito/private browser window
- Ensure you’re logged into the correct AWS account
- Verify you completed the subscription process in AWS Marketplace
- Try logging out of both AWS and Cerebras, then start fresh
Rate Limiting
Rate Limiting
- Review our rate limits documentation
- Implement exponential backoff in your retry logic
- Consider distributing requests across multiple API keys
- Monitor your usage patterns in the Cerebras dashboard
- Contact sales to discuss higher rate limits for production workloads
Model Not Found Errors
Model Not Found Errors
- Ensure you’re using the correct model name format:
model-name - Verify the model is available in your region
- Check the models documentation for current model availability
- Some models may have limited availability - contact support for details
Support
For technical support and questions: Cerebras Technical Issues- Email: support@cerebras.ai
- Documentation: inference-docs.cerebras.ai
- Use AWS Support through your AWS Console
- Visit the AWS Marketplace Help Center
Ready to experience the fastest inference in the world? Start building with Cerebras through AWS Marketplace today and see the difference that purpose-built AI hardware makes.

