Deploy model to endpoint
POST
This feature is in Private Preview. For access or more information, contact us or reach out to your account representative.
Authorizations
Management API key generated from the Management API keys section on the API keys page at https://cloud.cerebras.ai. Use the format: Bearer <MANAGEMENT_API_KEY>
Path Parameters
Unique identifier for the endpoint. It is used as the model field when making an inference request.
Example: my-org-gpt-oss-120b
curl --location 'https://api.cerebras.ai/v1/chat/completions' \
--header 'Content-Type: application/json' \
--header "Authorization: Bearer ${CEREBRAS_API_KEY}" \
--data '{
"model": "my-org-gpt-oss-120b",
"messages": [{"content": "Hello!", "role": "user"}]
}'Body
application/json
Model version name in the format of orgs/<org_name>/models/<model_arch_id>/versions/<version_id>, where <version_id> can be an integer version ID or a model version alias.

