Learn how to use Cerebras Inference on Hugging Face.
Install the Hugging Face Hub client
Create a new Hugging Face API key
Make an API call
"hf_your_api_key_here"
with your actual API key.What context length can I run?
What additional latency can I expect when using Cerebras through Hugging Face?
Why do I see “Wrong API Format“ when running the Hugging Face test code?