Build with the Speed of Cerebras - Cerebras Inference

Get a free API key by visiting our API playground.

Get started

Build your first application using our QuickStart guide.

import os
from cerebras.cloud.sdk import Cerebras

client = Cerebras(
  api_key=os.environ.get("CEREBRAS_API_KEY"),
)

chat_completion = client.chat.completions.create(
  messages=[
  {"role": "user", "content": "Why is fast inference important?",}
],
  model="llama-3.3-70b",
)

Explore Models

View our available models, including performance specifications, rate limits, and pricing details.

Get Familiar

Try our live chatbot demo.
Learn more about pricing.
Experiment with our inference solution in the playground before making an API call.
Explore our API reference documentation.

https://api.cerebras.ai/v1

⌘I