Skip to main content
GET
/
management
/
v1
/
endpoints
/
{endpoint_id}
Retrieve endpoint status
curl --request GET \
  --url https://api.cerebras.ai/management/v1/endpoints/{endpoint_id} \
  --header 'Authorization: Bearer <token>'
{
  "name": "my-org-llama-3.3-70b",
  "model_arch_id": "llama-3.3-70b",
  "deployed_models": [
    {
      "id": "550e8400-e29b-41d4-a716-446655440000",
      "model": "orgs/my-org/models/llama-3.3-70b/versions/1",
      "version_alias": "production-jan-12-2026-finetuned",
      "created": 1736700000,
      "state": "complete"
    }
  ],
  "managing_org_name": "my-org",
  "created": 1736600000,
  "updated": 1736700000
}
This feature is in Private Preview. For access or more information, contact us or reach out to your account representative.
Describes the status of an endpoint, including deployment status and active model versions.

Authorizations

Authorization
string
header
required

Management API key generated from the Management API keys section on the API keys page at https://cloud.cerebras.ai. Use the format: Bearer <MANAGEMENT_API_KEY>

Path Parameters

endpoint_id
string
required

Unique identifier for the endpoint. It is used as the model field when making an inference request.

Example: my-org-llama-3.3-70b

curl --location 'https://api.cerebras.ai/v1/chat/completions' \
--header 'Content-Type: application/json' \
--header "Authorization: Bearer ${CEREBRAS_API_KEY}" \
--data '{
"model": "my-org-llama-3.3-70b",
"messages": [{"content": "Hello!", "role": "user"}]
}'

Response

200 - application/json

Successful Response

name
string
required

Endpoint name (e.g. my-org-llama-3.3-70b).

model_arch_id
string
required

Name of the model architecture (e.g. llama3.1-8b, llama-3.3-70b).

deployed_models
DeployedModel · object[]
required

List of deployed models.

created
integer
required

Unix timestamp (in seconds) when the endpoint was created.

updated
integer
required

Unix timestamp (in seconds) when the endpoint was last updated.

managing_org_name
string | null

Organization that has management access to this endpoint.