The Cerebras Code MCP server is currently in research preview and is open source here. We welcome contributions!
1
Set up your API key
You need a valid Cerebras API key. Please visit this link and sign up, then click on API Keys in the left navigation.
Optionally, create an OpenRouter key here to use as fallback if you hit Cerebras rate limits.
2
Install the NPM Package
Open your preferred IDE and run the following code in your terminal:
3
Run the Setup Wizard
In your terminal, run:This will begin the setup process where you can configure different editors (like Claude Code) and set API keys for Cerebras and OpenRouter.
Claude Code
Enable MCP usage with Claude Code
Claude Code
Enable MCP usage with Claude Code
Run the following to verify the setup:The output should look like this:
Available Models
The Cerebras Code MCP Server supports all Cerebras models:| Model | Parameters | Best For |
|---|---|---|
| llama-3.3-70b | 70B | Best for complex reasoning, long-form content, and tasks requiring deep understanding |
| qwen-3-32b | 32B | Balanced performance for general-purpose applications |
| llama3.1-8b | 8B | Fastest option for simple tasks and high-throughput scenarios |
| gpt-oss-120b | 120B | Largest model for the most demanding tasks |
| zai-glm-4.7 | 357B | Advanced 357B parameter model with strong reasoning capabilities |

