List endpoints

curl --request GET \
  --url https://api.cerebras.ai/management/v1/orgs/{org_name}/endpoints \
  --header 'Authorization: Bearer <token>'

{
  "endpoints": [
    {
      "endpoint_id": "my-org-gpt-oss-120b",
      "model_arch_id": "gpt-oss-120b",
      "created": 1736700000,
      "updated": 1736700000,
      "org_name": "my-org"
    },
    {
      "endpoint_id": "my-org-llama3.1-8b",
      "model_arch_id": "llama3.1-8b",
      "created": 1736600000,
      "updated": 1736650000,
      "org_name": "my-org"
    }
  ]
}

GET

management

orgs

{org_name}

endpoints

List endpoints

curl --request GET \
  --url https://api.cerebras.ai/management/v1/orgs/{org_name}/endpoints \
  --header 'Authorization: Bearer <token>'

{
  "endpoints": [
    {
      "endpoint_id": "my-org-gpt-oss-120b",
      "model_arch_id": "gpt-oss-120b",
      "created": 1736700000,
      "updated": 1736700000,
      "org_name": "my-org"
    },
    {
      "endpoint_id": "my-org-llama3.1-8b",
      "model_arch_id": "llama3.1-8b",
      "created": 1736600000,
      "updated": 1736650000,
      "org_name": "my-org"
    }
  ]
}

This feature is in Private Preview. For access or more information, contact us or reach out to your account representative.

List all endpoints accessible to an organization for management.

Authorizations

Authorization

string

header

required

Management API key generated from the Management API keys section on the API keys page at https://cloud.cerebras.ai. Use the format: Bearer <MANAGEMENT_API_KEY>

Path Parameters

org_name

string

required

Cerebras customer management organization name found under Management API keys on the API keys page at https://cloud.cerebras.ai.

Note: This is not to be confused with org_id.

Response

200 - application/json

Successful Response

endpoints

EndpointSummary · object[]

required

Endpoints that can be managed via the API.

Hide child attributes

endpoints.endpoint_id

string

required

Unique identifier for the endpoint. It is used as the model field when making an inference request.

Example: my-org-gpt-oss-120b

curl --location 'https://api.cerebras.ai/v1/chat/completions' \
--header 'Content-Type: application/json' \
--header "Authorization: Bearer ${CEREBRAS_API_KEY}" \
--data '{
  "model": "my-org-gpt-oss-120b",
  "messages": [{"content": "Hello!", "role": "user"}]
}'

endpoints.model_arch_id

string

required

Name of the model architecture (e.g. llama3.1-8b, gpt-oss-120b).

endpoints.created

integer | null

Unix timestamp (in seconds) when the endpoint was created.

endpoints.updated

integer | null

Unix timestamp (in seconds) when the endpoint was last updated.

endpoints.org_name

string | null

Organization currently managing the endpoint.

Update model version aliases

Deploy model to endpoint

⌘I

Introduction

Chat

Completions

Models

Batch

Files

Metrics

Management

List endpoints

Authorizations

Path Parameters

Response