Prerequisites
Before you begin, ensure you have:- Cerebras API Key - Get a free API key here.
- ElevenLabs API Key - Visit ElevenLabs and create an account. Navigate to your profile settings to generate an API key.
- Python 3.10 or higher - Required for running the integration code.
Configure ElevenLabs Integration
1
Install required dependencies
Install the necessary Python packages for both Cerebras Inference and ElevenLabs:The
openai package provides the client for Cerebras Inference (OpenAI-compatible), and elevenlabs is the official ElevenLabs SDK for voice synthesis.Audio playback requirement: To play audio files, you may need to install FFmpeg:
- macOS:
brew install ffmpeg - Windows: Download from ffmpeg.org or use
choco install ffmpeg - Linux:
sudo apt install ffmpeg(Ubuntu/Debian) orsudo yum install ffmpeg(CentOS/RHEL)
2
Configure environment variables
Create a Alternatively, you can set these as environment variables in your shell:
.env file in your project directory to securely store your API keys:3
Initialize the Cerebras client
Set up the Cerebras client using the OpenAI-compatible interface. The integration header helps us track and optimize this integration:
4
Create a basic text-to-speech pipeline
Now let’s create a complete pipeline that generates text with Cerebras and converts it to speech with ElevenLabs. This example demonstrates the power of combining Cerebras’s fast inference with ElevenLabs’s natural voice synthesis:
5
Build a conversational voice agent
For a more advanced use case, here’s how to build a multi-turn conversational agent that maintains context across multiple interactions:This voice agent maintains conversation context and provides natural, spoken responses using Cerebras’s fast inference and ElevenLabs’s voice synthesis.
6
Stream responses for lower latency
For even faster response times, you can stream the Cerebras output and generate speech in chunks. Streaming provides the lowest possible latency by starting audio playback as soon as content is ready:
Voice Selection
ElevenLabs offers a variety of pre-made voices. Here are some popular options:- Rachel (21m00Tcm4TlvDq8ikWAM) - Calm, professional female voice
- Adam (pNInz6obpgDQGcFmaJgB) - Deep, authoritative male voice
- Bella (EXAVITQu4vr4xnSDxMaL) - Soft, friendly female voice
- Antoni (ErXwobaYiN019PkySvjV) - Well-rounded male voice
Use Cases
The Cerebras + ElevenLabs integration is perfect for:- Voice Assistants - Build responsive AI assistants with natural conversation flow
- Content Creation - Generate and narrate articles, stories, or educational content
- Customer Service - Create automated voice support systems with human-like responses
- Accessibility Tools - Convert text content to speech for visually impaired users
- Interactive Experiences - Build voice-enabled games, tours, or educational apps
- Podcast Generation - Automatically create podcast episodes from text content
FAQ
Audio playback not working
Audio playback not working
If you’re having trouble playing audio:
- Ensure you have audio output devices properly configured
- Try saving the audio to a file instead of playing directly:
- Install additional audio libraries if needed:
pip install sounddevice soundfile
Rate limiting errors
Rate limiting errors
If you encounter rate limiting:
- Cerebras: Check your rate limits and consider upgrading your plan
- ElevenLabs: Monitor your character quota in the ElevenLabs dashboard. Free tier has monthly limits.
High latency in responses
High latency in responses
To reduce latency:
- Use streaming for both text generation and audio synthesis (see Step 6)
- Keep responses concise by setting lower
max_completion_tokensvalues - Use faster Cerebras models like
llama3.1-8bfor simpler tasks - Consider caching common responses
Next Steps
- Explore the ElevenLabs API documentation for advanced features like voice cloning and dubbing
- Try different Cerebras models like
qwen-3-32bfor specialized tasks - Experiment with streaming responses for even lower latency
- Learn about structured outputs to format responses for voice synthesis
- Check out the ElevenLabs Voice Library for more voice options
- Migrate to GLM4.6: Ready to upgrade? Follow our migration guide to start using our latest model

