The Playground lets you experiment with Cerebras models directly in the browser. Use it to evaluate models, iterate on prompts, test tool/function calling, and tune parameters, then export a working request to code when you’re ready.
Your playground and API requests are never used to train models.
Message Roles
| Role | Purpose |
|---|
| System | Sets the behavior and context for the model. Use this to define a persona, provide background information, or constrain responses. |
| User | Represents input from the human turn. This is the prompt the model responds to. |
| Assistant | Represents a prior model response. Insert assistant messages to simulate a multi-turn conversation or prime the model toward a particular style or format. |
Use the Add button to add a new User or Assistant message to the conversation without running inference. Use it to build multi-turn conversations manually. Click Run to send the full conversation to the model.
After each response, the Playground displays token usage, inference time, speed (tokens per second), and round trip time in the upper-right corner of the response.
Configuration
Select a model from the dropdown at the top right. See the Models overview to learn more about the available options.
| Parameter | What it controls |
|---|
| Temperature | Randomness of the output. Lower values produce more focused, predictable responses; higher values produce more varied responses. |
| Max Completion Tokens | Maximum number of tokens the model will generate in a single response. |
| Top P | Nucleus sampling threshold. Limits the model to sampling from the top portion of the probability distribution. |
| Format | Output format. text (default), json_object, or json_schema. |
| Functions | Define tool/function schemas the model can call. See Tool Calling for the full reference. |
| Reasoning Effort | Controls how much effort the model spends reasoning before responding. Only available for select models. |
| Seed | Set an integer to produce deterministic outputs across requests with the same input. |
| Stream | Stream tokens as they’re generated rather than returning the full response at once. |
| Stop Sequence | A string that causes the model to stop generating when produced. Useful for structured outputs. |
View Code
Once you have a prompt and parameters you’re happy with, click View Code to get a ready-to-run code snippet for your application.