curl --request GET \
--url https://api.cerebras.ai/management/v1/orgs/{org_name}/endpoints \
--header 'Authorization: Bearer <token>'{
"endpoints": [
{
"endpoint_id": "my-org-llama-3.3-70b",
"model_arch_id": "llama-3.3-70b",
"created": 1736700000,
"updated": 1736700000,
"org_name": "my-org"
},
{
"endpoint_id": "my-org-qwen-3-32b",
"model_arch_id": "qwen-3-32b",
"created": 1736600000,
"updated": 1736650000,
"org_name": "my-org"
}
]
}curl --request GET \
--url https://api.cerebras.ai/management/v1/orgs/{org_name}/endpoints \
--header 'Authorization: Bearer <token>'{
"endpoints": [
{
"endpoint_id": "my-org-llama-3.3-70b",
"model_arch_id": "llama-3.3-70b",
"created": 1736700000,
"updated": 1736700000,
"org_name": "my-org"
},
{
"endpoint_id": "my-org-qwen-3-32b",
"model_arch_id": "qwen-3-32b",
"created": 1736600000,
"updated": 1736650000,
"org_name": "my-org"
}
]
}Management API key generated from the Management API keys section on the API keys page at https://cloud.cerebras.ai. Use the format: Bearer <MANAGEMENT_API_KEY>
Cerebras customer management organization name found under Management API keys on the API keys page at https://cloud.cerebras.ai.
Note: This is not to be confused with org_id.
Successful Response
Endpoints that can be managed via the API.
Hide child attributes
Unique identifier for the endpoint. It is used as the model field when making an inference request.
Example: my-org-llama-3.3-70b
curl --location 'https://api.cerebras.ai/v1/chat/completions' \
--header 'Content-Type: application/json' \
--header "Authorization: Bearer ${CEREBRAS_API_KEY}" \
--data '{
"model": "my-org-llama-3.3-70b",
"messages": [{"content": "Hello!", "role": "user"}]
}'
Name of the model architecture (e.g. llama3.1-8b, llama-3.3-70b).
Unix timestamp (in seconds) when the endpoint was created.
Unix timestamp (in seconds) when the endpoint was last updated.
Organization currently managing the endpoint.
Was this page helpful?