X-Cerebras-Version-Patch header and becomes the default on July 21, 2026.
Review what changed, then set
X-Cerebras-Version-Patch: 2 to test your integration before the July 21, 2026 cutover.What’s Guaranteed
Within a given API version, we guarantee:- Existing request parameters continue to work as documented
- Existing response fields remain present with the same types
- New optional request parameters may be added
- New fields may be added to responses
- Error message text may change (but not error types within a version)
Setting the API Version
Override the default API version per-request by passing theX-Cerebras-Version-Patch header:
Version 2 Rollout Timeline
| Date | What Happens |
|---|---|
| January 21, 2026 | Version 2 available for testing via header |
| July 21, 2026 | Version 2 becomes the default; older versions end-of-life |
What Changed in Version 2
Version 2 introduces stricter validation for structured outputs and tool calling, refines reasoning model behavior, and fixes edge cases. If your application uses JSON schemas, tool calls, or reasoning models, review these changes carefully.Structured Outputs: Stricter Schema Validation
Version 2 requires explicit, strictly-typed schemas. The following patterns will return errors:Schemas must use
$defs for reusable definitions. The definitions keyword is not supported.- Optional fields
- additionalProperties
- Enum types
Fields omitted from
required may be absent from the response instead of returning null. Update your code to handle missing fields rather than expecting null values.Tool Calling: Stricter Message Validation
Version 2 validates the structure of multi-turn conversations involving tool calls:| Validation | What it checks |
|---|---|
| Tool response completeness | Every tool_call_id in an assistant message must have a corresponding tool message immediately following |
| Orphan tool messages | Tool messages cannot reference a tool_call_id that doesn’t exist in a prior assistant message |
| Tool choice consistency | tool_choice cannot be set unless tools is also provided |
| Unique tool call IDs | Duplicate tool_call.id values in the same request are rejected |
Reasoning Format Default Changed
For reasoning models, the defaultreasoning_format is now parsed, which returns reasoning in a separate reasoning field and logprobs in a separate reasoning_logprobs field. If your application parses reasoning from the content field, you’ll need to update it to read from the reasoning field instead, or explicitly set reasoning_format: "raw" to preserve the previous behavior.
See Reasoning for details on reasoning format options.
Unicode Handling Fix
Logprobs now correctly handle the Unicode replacement character (\uFFFD). Previously, requests returning this character could show a mismatch between logprobs tokens and the actual content. This fix is backward-compatible and requires no code changes.
Migration Checklist
Use this checklist to prepare for version 2:1
Test with the version header
Add
X-Cerebras-Version-Patch: 2 to your requests and run your test suite.2
Update JSON schemas
- Add all properties to the
requiredarray - Add
additionalProperties: falseto all object definitions - Ensure enum values match their property types
3
Update reasoning parsing (if using reasoning models)
If you parse reasoning content, update your code to read from the
reasoning field in the response instead of extracting it from content.4
You're done
Once your tests pass, no further action is required. On July 21, 2026, version 2 takes effect and the header is no longer needed.
Future Versions
We strive to maintain backward compatibility whenever possible. In rare cases where breaking changes are necessary, a new API version will be released and made available for testing via theX-Cerebras-Version-Patch header for a minimum of 6 months before taking effect. Cerebras Cloud users will be notified via email before each version transition.
