Skip to main content
Get a free API key by visiting our API playground.

Get started

Build your first application using our QuickStart guide.
import os
from cerebras.cloud.sdk import Cerebras

client = Cerebras(
  api_key=os.environ.get("CEREBRAS_API_KEY"),
)

chat_completion = client.chat.completions.create(
  messages=[
  {"role": "user", "content": "Why is fast inference important?",}
],
  model="llama-3.3-70b",
)

Explore Models

View our available models, including performance specifications, rate limits, and pricing details.

Get Familiar