curl -sS 'https://api.cerebras.ai/public/v1/models' | jq
{
"object": "list",
"data": [
{
"id": "gpt-oss-120b",
"object": "model",
"created": 1754438400,
"owned_by": "OpenAI",
"name": "OpenAI GPT OSS",
"description": "This model excels at efficient reasoning across science, math, and coding applications. It's ideal for real-time coding assistance, processing large documents for Q&A and summarization, agentic research workflows, and regulated on-premises workloads.",
"hugging_face_id": "openai/gpt-oss-120b",
"pricing": {
"prompt": "0.00000035",
"completion": "0.00000075"
},
"capabilities": {
"streaming": true,
"function_calling": true,
"structured_outputs": true,
"vision": false,
"json_mode": true,
"tools": true,
"tool_choice": true,
"parallel_tool_calls": false,
"response_format": true,
"reasoning": true
},
"supported_parameters": {
"temperature": true,
"top_p": true,
"seed": true,
"stop": true,
"max_completion_tokens": true,
"logprobs": true,
"top_logprobs": true,
"frequency_penalty": true,
"presence_penalty": true,
"logit_bias": true,
"repetition_penalty": false
},
"architecture": {
"modality": "text",
"tokenizer": "GPT",
"instruct_type": "harmony"
},
"limits": {
"max_context_length": 131072,
"max_completion_tokens": 40960,
"requests_per_minute": null,
"tokens_per_minute": null
},
"datacenter_locations": [],
"deprecated": false,
"preview": false,
"quantization": "FP16/8 (weights only)"
}
]
}
curl -sS 'https://api.cerebras.ai/public/v1/models' | jq
{
"object": "list",
"data": [
{
"id": "gpt-oss-120b",
"object": "model",
"created": 1754438400,
"owned_by": "OpenAI",
"name": "OpenAI GPT OSS",
"description": "This model excels at efficient reasoning across science, math, and coding applications. It's ideal for real-time coding assistance, processing large documents for Q&A and summarization, agentic research workflows, and regulated on-premises workloads.",
"hugging_face_id": "openai/gpt-oss-120b",
"pricing": {
"prompt": "0.00000035",
"completion": "0.00000075"
},
"capabilities": {
"streaming": true,
"function_calling": true,
"structured_outputs": true,
"vision": false,
"json_mode": true,
"tools": true,
"tool_choice": true,
"parallel_tool_calls": false,
"response_format": true,
"reasoning": true
},
"supported_parameters": {
"temperature": true,
"top_p": true,
"seed": true,
"stop": true,
"max_completion_tokens": true,
"logprobs": true,
"top_logprobs": true,
"frequency_penalty": true,
"presence_penalty": true,
"logit_bias": true,
"repetition_penalty": false
},
"architecture": {
"modality": "text",
"tokenizer": "GPT",
"instruct_type": "harmony"
},
"limits": {
"max_context_length": 131072,
"max_completion_tokens": 40960,
"requests_per_minute": null,
"tokens_per_minute": null
},
"datacenter_locations": [],
"deprecated": false,
"preview": false,
"quantization": "FP16/8 (weights only)"
}
]
}
Retrieve details about publicly available Cerebras models without an API key, including context length, pricing, and supported features. This endpoint supports multiple response formats for compatibility with different platforms.Documentation Index
Fetch the complete documentation index at: https://inference-docs.cerebras.ai/llms.txt
Use this file to discover all available pages before exploring further.
format query parameter:
openrouter, huggingface. Omit for default Cerebras format.list for list responses or model for single model responses.Show model properties
gpt-oss-120b).model.Show capability fields
Show extended properties
["text"]).["text"]).Show pricing fields
"0")."0" for text-only models)."0")."0").["tools", "json_mode", "structured_outputs"]).Show properties
model.curl -sS 'https://api.cerebras.ai/public/v1/models' | jq
{
"object": "list",
"data": [
{
"id": "gpt-oss-120b",
"object": "model",
"created": 1754438400,
"owned_by": "OpenAI",
"name": "OpenAI GPT OSS",
"description": "This model excels at efficient reasoning across science, math, and coding applications. It's ideal for real-time coding assistance, processing large documents for Q&A and summarization, agentic research workflows, and regulated on-premises workloads.",
"hugging_face_id": "openai/gpt-oss-120b",
"pricing": {
"prompt": "0.00000035",
"completion": "0.00000075"
},
"capabilities": {
"streaming": true,
"function_calling": true,
"structured_outputs": true,
"vision": false,
"json_mode": true,
"tools": true,
"tool_choice": true,
"parallel_tool_calls": false,
"response_format": true,
"reasoning": true
},
"supported_parameters": {
"temperature": true,
"top_p": true,
"seed": true,
"stop": true,
"max_completion_tokens": true,
"logprobs": true,
"top_logprobs": true,
"frequency_penalty": true,
"presence_penalty": true,
"logit_bias": true,
"repetition_penalty": false
},
"architecture": {
"modality": "text",
"tokenizer": "GPT",
"instruct_type": "harmony"
},
"limits": {
"max_context_length": 131072,
"max_completion_tokens": 40960,
"requests_per_minute": null,
"tokens_per_minute": null
},
"datacenter_locations": [],
"deprecated": false,
"preview": false,
"quantization": "FP16/8 (weights only)"
}
]
}
GET https://api.cerebras.ai/public/v1/models/{model_id}
gpt-oss-120b, zai-glm-4.7).curl -sS 'https://api.cerebras.ai/public/v1/models/gpt-oss-120b' | jq
{
"id": "gpt-oss-120b",
"object": "model",
"created": 1754438400,
"owned_by": "OpenAI",
"name": "OpenAI GPT OSS",
"description": "This model excels at efficient reasoning across science, math, and coding applications. It's ideal for real-time coding assistance, processing large documents for Q&A and summarization, agentic research workflows, and regulated on-premises workloads.",
"hugging_face_id": "openai/gpt-oss-120b",
"pricing": {
"prompt": "0.00000035",
"completion": "0.00000075"
},
"capabilities": {
"streaming": true,
"function_calling": true,
"structured_outputs": true,
"vision": false,
"json_mode": true,
"tools": true,
"tool_choice": true,
"parallel_tool_calls": false,
"response_format": true,
"reasoning": true
},
"limits": {
"max_context_length": 131072,
"max_completion_tokens": 40960,
"requests_per_minute": null,
"tokens_per_minute": null
},
"deprecated": false,
"preview": false,
"quantization": "FP16/8 (weights only)"
}
Was this page helpful?