Whether you’re building agents, refactoring legacy code, or exploring new codebases, you can now work at the speed of thought using the latest open models like Qwen 3 32B and Qwen 3 235B — powered by Cerebras’ wafer-scale inference.

Set Up Cerebras in Cline

1

Get a free Cerebras API key

Get an API key here. Step 1
2

Install Cline

Head to cline.bot to install the fastest local AI dev environment. Step 2
3

Configure your Cerebras API key

  • Inside of your code editor, go to your Cline settings: Step 3
  • In your Cline settings, select Cerebras: Step 3-1
  • Paste in your API key: Step 3-2
4

Choose a Cerebras-backed model

In the model selector, pick Qwen-3-32B (Cerebras), Qwen-3-235B (Cerebras), or Llama3.3-70B (Cerebras) to tap into lightning-fast inference. Step 4
5

Start coding — but faster

Use Cline as usual: refactor, generate, build agents, or explore codebases — now with near-zero latency
6

Need more speed or scale?

Upgrade with Cerebras credits for 24/7 access to premium models like Qwen3-235B.