OVHcloud Engineering

Follow our adventurous engineering team through the various Tech universe they discover and share on through and for the technical communities

reference architecture vLLM deployment and metrics obervability stack

Reference Architecture: Custom metric autoscaling for LLM inference with vLLM on OVHcloud AI Deploy and observability using MKS

Take your LLM (Large Language Model) deployment to production level with comprehensive custom autoscaling configuration and advanced vLLM metrics observability. This reference architecture describes a comprehensive solution for deploying, autoscaling and monitoring vLLM-based LLM workloads on OVHcloud infrastructure. It combinesAI Deploy, used for model serving with custom metric autoscaling, and Managed Kubernetes Service (MKS), which […]

Reference Architecture: Custom metric autoscaling for LLM inference with vLLM on OVHcloud AI Deploy and observability using MKS Read More »

n8n rag AI agent architecture schemas

Reference Architecture: build a sovereign n8n RAG workflow for AI agent using OVHcloud Public Cloud solutions

What if an n8n workflow, deployed in a sovereign environment, saved you time while giving you peace of mind? From document ingestion to targeted response generation, n8n acts as the conductor of your RAG pipeline without compromising data protection. In the current landscape of AI agents and knowledge assistants, connecting your internal documentation with Large Language Models (LLMs)

Reference Architecture: build a sovereign n8n RAG workflow for AI agent using OVHcloud Public Cloud solutions Read More »

An image with a lock

Safety first: Detect harmful texts using an AI safeguard agent

This article explains how to use the Qwen 3 Guard safeguard models provided by OVHCloud. Using this guide, you can analyse and moderate texts for LLM applications, chat platforms, customer support systems, or any other text-based services requiring safe and compliant interactions. Our focus will be on written content, such as conversations or plain text.

Safety first: Detect harmful texts using an AI safeguard agent Read More »

Moving Beyond Ingress: Why should OVHcloud Managed Kubernetes Service (MKS) users start looking at the Gateway API?

For years, the Kubernetes Ingress API, and the popular Ingress NGINX controller (ingress-nginx), have been the default way to expose applications running inside a Kubernetes cluster. But the ecosystem is changing: the Kubernetes SIG network has announced the retirement of Ingress NGINX in March 2026. After March 2026 the Ingress NGINX will no longer get

Moving Beyond Ingress: Why should OVHcloud Managed Kubernetes Service (MKS) users start looking at the Gateway API? Read More »

Industrial Excellence meets Artificial Intelligence: Behind the Scenes with Smart Datacenter

At OVHcloud, we are constantly looking for ways to improve our operations and reduce our impact on the environment. This has been a defining part of the company since 1999 and is a key part of our organisational DNA and our commercial model. We are very proud to present the new Smart Datacenter cooling system,

Industrial Excellence meets Artificial Intelligence: Behind the Scenes with Smart Datacenter Read More »

Manage your secrets using OVHcloud Secret Manager with External Secrets Operator (ESO) on OVHcloud Managed Kubernetes Service (MKS)

Secrets resources in Kubernetes help us keep sensitive information like logins, passwords, tokens, credentials and certificates secure. But just a heads up: Secrets in Kubernetes are base64 encoded, not encrypted so anyone can read and decode them if they know how. The good news is that OVHcloud has just launched the Secret Manager Beta, which

Manage your secrets using OVHcloud Secret Manager with External Secrets Operator (ESO) on OVHcloud Managed Kubernetes Service (MKS) Read More »

OVHcloud backbone network: Environmental impact assessment methodology

Introduction The underlying infrastructure of OVHcloud’s Cloud services consists of datacentres connected by a global telecommunication network which carries data to and from end users. The core network (backbone) features nodes (also known as Points of Presence – PoPs) and long-distance/metropolitan spans (also known as links) which connect the nodes. The PoPs are located in

OVHcloud backbone network: Environmental impact assessment methodology Read More »

A robot listening a podcast

Create a podcast transcript with Whisper by AI Endpoints

Check out this blog post if you want to know more about AI Endpoints.You can also find more info on AI Endpoints in our previous blog posts. This blog post explains how to create a podcast transcript using Whisper, a powerful automatic speech recognition (ASR) system developed by OpenAI. Whisper integrates with AI Endpoints and

Create a podcast transcript with Whisper by AI Endpoints Read More »

Create encrypted Persistent Volumes on OVHcloud Managed Kubernetes clusters with LUKS

Since this summer, it’s possible to create encrypted OVHcloud Block Storage with OMK (OVHcloud managed key) in RBX, SBG, Paris & BHS regions. More regions will come in the coming months 💪. And the good news is that you can use encrypted Block Storage using Persistent Volumes in your OVHcloud Managed Kubernetes Service (MKS) clusters

Create encrypted Persistent Volumes on OVHcloud Managed Kubernetes clusters with LUKS Read More »

A robot with a car tuning style

Fine tune an LLM with Axolotl and OVHcloud Machine Learning Services

There are many ways to train a model,📚 using detailed instructions, system prompts, Retrieval Augmented Generation, or function calling One way is fine-tuning, which is what this blog is about! ✨ Two years back we posted a blog on fine-tuning Llama models—it’s not nearly as complicated as it was before 😉.  This time we’re using the

Fine tune an LLM with Axolotl and OVHcloud Machine Learning Services Read More »