Inference Providers
Inference Providers is a serverless service powered by external inference providers and routed through Hugging Face and paid per token.You can access your access token from Hugging Face and prioritize your providers in settings.
- YAML
- JSON (Deprecated)
config.yaml
Inference Endpoints
Inference Endpoints is a dedicated service that allows you to run your open models dedicated hardware. It is a more advanced way to get inference from Hugging Face models where you have more control over the whole process.Before you can use Inference Endpoints, you need to create an endpoint. You can do this by going to Inference Endpoints and clicking on “Create Endpoint”.
- YAML
- JSON (Deprecated)
config.yaml