Skip to main content

Context Length

Free Tier64k tokens
Paid Tiers131k tokens

Speed

~3000
tokens/sec

Input / Output

Input Formats JSON, plain text
Output Formatsplain text, structured

Pricing

Input
$0.35 / M tokens
Output
$0.75 / M tokens
Exploration pricing shown above is per million tokens. For volume discounts and enterprise features, see our pricing page.

Model Notes

Model ID: gpt-oss-120b
Logprobs with response format or tool usage is not yet supported for this model.
When min_tokens is set, the model may generate EOS (End of Sequence) tokens which may cause parser failures. Use at your own risk.
This model may call tools that aren't directly specified due to it's training. Monitor for non-approved tools and reprompt with "you're hallucinating a tool call" to help the model self-correct and stick to provided tools.
For this model, our API maps the "system" role to developer-level instructions in our prompt hierarchy. See our OpenAI Compatibility guide for more details.

Rate Limits

TierRequests/minInput Tokens/minOutput Tokens/minDaily Tokens
Free3064k8k/request1M

Endpoints

Chat Completions

Features

Reasoning
Streaming
Structured Outputs
Tool Calling

Need Higher Limits?

Reach out for custom pricing with our Enterprise tier for higher rate limits and dedicated support.Contact Sales
I