Public models

Retrieve details about publicly available Cerebras models without an API key, including context length, pricing, and supported features. This endpoint supports multiple response formats for compatibility with different platforms.

This endpoint is public and does not require an API key.

Supported Formats

The endpoint supports three response formats via the format query parameter:

Default (Cerebras) - Native Cerebras format
OpenRouter - OpenRouter-compatible format
HuggingFace - HuggingFace-compatible format

Query Parameters

format

string

Response format. Options: openrouter, huggingface. Omit for default Cerebras format.

Response

Default Format

object

string

The object type, which is always list for list responses or model for single model responses.

data

array

Array of model objects (only in list responses).

Show model properties

string

The model identifier (e.g., llama3.1-8b).

object

string

The object type, which is always model.

created

integer

The Unix timestamp (in seconds) of when the model was created.

owned_by

string

The organization that owns the model.

OpenRouter Format

data

array

Array of model objects with extended metadata.

Show extended properties

string

The model identifier.

name

string

Human-readable model name.

input_modalities

array

Supported input types (e.g., ["text"]).

output_modalities

array

Supported output types (e.g., ["text"]).

context_length

integer

Maximum context window size in tokens.

max_output_length

integer

Maximum output length in tokens.

pricing

object

Pricing information per token.

Show pricing fields

prompt

string

Cost per prompt token.

completion

string

Cost per completion token.

supported_sampling_parameters

array

List of supported sampling parameters.

supported_features

array

List of supported features (e.g., ["streaming", "json_mode", "tools"]).

HuggingFace Format

data

array

Array of model objects.

Show properties

string

The model identifier.

object

string

The object type, which is always model.

created

integer

The Unix timestamp (in seconds) of when the model was created.

owned_by

string

The organization that owns the model.

pricing

object

Pricing in USD per million tokens.

Show pricing fields

input

number

Price per million input tokens.

output

number

Price per million output tokens.

context_length

integer

Maximum context window size in tokens.

curl -sS 'https://api.cerebras.ai/public/v1/models' | jq

{
  "object": "list",
  "data": [
    {
      "id": "llama3.1-8b",
      "object": "model",
      "created": 1234567890,
      "owned_by": "cerebras"
    },
    {
      "id": "gpt-oss-120b",
      "object": "model",
      "created": 1234567890,
      "owned_by": "cerebras"
    }
  ]
}

Retrieve Specific Model

You can also retrieve information about a specific model by appending the model ID to the endpoint:

GET https://api.cerebras.ai/public/v1/models/{model_id}

Path Parameters

model_id

string

required

The ID of the model to retrieve (e.g., llama3.1-8b, gpt-oss-120b).

curl -sS 'https://api.cerebras.ai/public/v1/models/llama3.1-8b' | jq

{
  "id": "llama3.1-8b",
  "object": "model",
  "created": 1234567890,
  "owned_by": "cerebras"
}

Use Cases

Platform Integration - Use the OpenRouter or HuggingFace formats to integrate Cerebras models into existing platforms that support these standards. Model Discovery - Programmatically discover available models and their capabilities without authentication. Pricing Comparison - Compare pricing across different models using the structured pricing information. Feature Detection - Check which features (streaming, tools, JSON mode) are supported by each model.

Introduction

Chat

Completions

Models

Batch

Files

Metrics

Management

Supported Formats

Query Parameters

Response

Default Format

OpenRouter Format

HuggingFace Format

Retrieve Specific Model

Path Parameters

Use Cases

Introduction

Chat

Completions

Models

Batch

Files

Metrics

Management

​Supported Formats

​Query Parameters

​Response

​Default Format

​OpenRouter Format

​HuggingFace Format

​Retrieve Specific Model

​Path Parameters

​Use Cases

Supported Formats

Query Parameters

Response

Default Format

OpenRouter Format

HuggingFace Format

Retrieve Specific Model

Path Parameters

Use Cases