title: OVHcloud AI Endpoints

OVHcloud AI Endpoints is a serverless inference API that provides access to a curated selection of models (e.g., Llama, Mistral, Qwen, Deepseek). It is designed with security and data privacy in mind and is compliant with GDPR.
To get started, create an API key on the OVHcloud AI Endpoints website. For more information, including pricing, visit the OVHcloud AI Endpoints product page.

Chat model

We recommend configuring Qwen2.5-Coder-32B-Instruct as your chat model. Check our catalog to see all of our models hsoted on AI Endpoints.
config.yaml
models:
  - name: Qwen2.5-Coder-32B-Instruct
    provider: ovhcloud
    model: qwen2.5-coder-32b
    apiKey: <YOUR_AIENDPOINTS_API_KEY>

Embeddings model

We recommend configuring bge-multilingual-gemma2 as your embeddings model.
config.yaml
models:
  - name: BGE Multilingual Gemma2
    provider: ovhcloud
    model: bge-multilingual-gemma2
    apiKey: <YOUR_AIENDPOINTS_API_KEY>
    roles: 
      - embed