The Playground lets you experiment with Cerebras models directly in the browser. Use it to evaluate models, iterate on prompts, test tool/function calling, and tune parameters, then export a working request to code when you’re ready.Documentation Index
Fetch the complete documentation index at: https://inference-docs.cerebras.ai/llms.txt
Use this file to discover all available pages before exploring further.
Your playground and API requests are never used to train models.
Message Roles
| Role | Purpose |
|---|---|
| System | Sets the behavior and context for the model. Use this to define a persona, provide background information, or constrain responses. |
| User | Represents input from the human turn. This is the prompt the model responds to. |
| Assistant | Represents a prior model response. Insert assistant messages to simulate a multi-turn conversation or prime the model toward a particular style or format. |
Configuration
Select a model from the dropdown at the top right. See the Models overview to learn more about the available options.| Parameter | What it controls |
|---|---|
| Temperature | Randomness of the output. Lower values produce more focused, predictable responses; higher values produce more varied responses. |
| Max Completion Tokens | Maximum number of tokens the model will generate in a single response. |
| Top P | Nucleus sampling threshold. Limits the model to sampling from the top portion of the probability distribution. |
| Format | Output format. text (default), json_object, or json_schema. |
| Functions | Define tool/function schemas the model can call. See Tool Calling for the full reference. |
| Reasoning Effort | Controls how much effort the model spends reasoning before responding. Only available for select models. |
| Seed | Set an integer to produce deterministic outputs across requests with the same input. |
| Stream | Stream tokens as they’re generated rather than returning the full response at once. |
| Stop Sequence | A string that causes the model to stop generating when produced. Useful for structured outputs. |

