Capabilities
Streaming Responses
Learn how to enable streaming responses in the Cerebras API.
The Cerebras API supports streaming responses, allowing messages to be sent back in chunks and displayed incrementally as they are generated. To enable this feature, set the stream
parameter to True
within the chat.completions.create
method. This will result in the API returning an iterable containing the chunks of the message.
Similarly, the same can be done in TypeScript by setting the stream
property to true
within the chat.completions.create
method.
1
Initial Setup
Begin by importing the Cerebras SDK and setting up the client.
2
Streaming Responses
Set the stream
parameter to True
within the chat.completions.create
method to enable streaming responses.
Was this page helpful?