OVHcloud AI Endpoints
OVHcloud AI Endpoints is a serverless inference API that provides access to a curated selection of models (e.g., Llama, Mistral, Qwen, Deepseek). It is designed with security and data privacy in mind and is compliant with GDPR.
info
To get started, create an API key on the OVHcloud AI Endpoints website. For more information, including pricing, visit the OVHcloud AI Endpoints product page.
Chat model
We recommend configuring Qwen2.5-Coder-32B-Instruct as your chat model. Check our catalog to see all of our models hsoted on AI Endpoints.
- YAML
- JSON
config.yaml
models:
- name: Qwen2.5-Coder-32B-Instruct
provider: ovhcloud
model: qwen2.5-coder-32b
apiKey: <YOUR_AIENDPOINTS_API_KEY>
config.json
{
"models": [
{
"title": "Qwen2.5-Coder-32B-Instruct",
"provider": "ovhcloud",
"model": "qwen2.5-coder-32b",
"apiKey": "<YOUR_AIENDPOINTS_API_KEY>"
}
]
}
Embeddings model
We recommend configuring bge-multilingual-gemma2 as your embeddings model.
- YAML
- JSON
config.yaml
models:
- name: BGE Multilingual Gemma2
provider: ovhcloud
model: bge-multilingual-gemma2
apiKey: <YOUR_AIENDPOINTS_API_KEY>
roles:
- embed
config.json
{
"embeddingsProvider": {
"provider": "ovhcloud",
"model": "bge-multilingual-gemma2",
"apiKey": "<YOUR_AIENDPOINTS_API_KEY>"
}
}