AI

vLLM on OVHcloud MKS for high availability and full observability

Reference Architecture: Deploying a vision-language model with vLLM on OVHcloud MKS for high performance inference and full observability

Ensure complete digital sovereignty of your AI models with end-to-end control through open-source solutions on OVHcloud’s Managed Kubernetes Service. This reference architecture demonstrates how to deploy a Large Language Model (LLM) inference system using vLLM on OVHcloud Managed Kubernetes Service (MKS). The solution leverages NVIDIA L40S GPUs to serve the Qwen3-VL-8B-Instruct multimodal model (vision + text) with OpenAI-compatible API endpoints. This comprehensive […]

Reference Architecture: Deploying a vision-language model with vLLM on OVHcloud MKS for high performance inference and full observability Read More »

Document to use with OCR

Extract Text from Images with OCR using Python and OVHcloud AI Endpoints

If you want to have more information on AI Endpoints, please read the following blog post. You can, also, have a look at our previous blog posts on how use AI Endpoints. You can find the full code example in the GitHub repository. In this article, we will explore how to perform OCR (Optical Character Recognition) on images using a vision-capable LLM, the OpenAI Python library, and

Extract Text from Images with OCR using Python and OVHcloud AI Endpoints Read More »

Pricing evolution of Public Cloud, Bare Metal and VPS at OVHcloud

For customers in the United States, the same article with US pricing is available here : https://us.ovhcloud.com/resources/blog/pricing-evolution-of-public-cloud-bare-metal-and-vps-at-ovhcloud/ Since autumn 2025, the global memory market has been going through a major disruption. Although barely noticeable to end users, these developments are radically changing the cost of computer hardware and, as a direct result, the cost of

Pricing evolution of Public Cloud, Bare Metal and VPS at OVHcloud Read More »

Évolutions tarifaires de Public Cloud, Bare Metal et VPS chez OVHcloud

Depuis l’automne 2025, le marché mondial de la mémoire subit une rupture majeure. Encore peu perceptible pour les utilisateurs finaux, cette évolution transforme en profondeur le coût du matériel informatique et, par effet direct, celui du cloud. Cet article propose un décryptage de cette crise structurelle, de ses impacts concrets et des choix stratégiques qu’OVHcloud

Évolutions tarifaires de Public Cloud, Bare Metal et VPS chez OVHcloud Read More »

reference architecture vLLM deployment and metrics obervability stack

Reference Architecture: Custom metric autoscaling for LLM inference with vLLM on OVHcloud AI Deploy and observability using MKS

Take your LLM (Large Language Model) deployment to production level with comprehensive custom autoscaling configuration and advanced vLLM metrics observability. This reference architecture describes a comprehensive solution for deploying, autoscaling and monitoring vLLM-based LLM workloads on OVHcloud infrastructure. It combinesAI Deploy, used for model serving with custom metric autoscaling, and Managed Kubernetes Service (MKS), which

Reference Architecture: Custom metric autoscaling for LLM inference with vLLM on OVHcloud AI Deploy and observability using MKS Read More »

n8n rag AI agent architecture schemas

Reference Architecture: build a sovereign n8n RAG workflow for AI agent using OVHcloud Public Cloud solutions

What if an n8n workflow, deployed in a sovereign environment, saved you time while giving you peace of mind? From document ingestion to targeted response generation, n8n acts as the conductor of your RAG pipeline without compromising data protection. In the current landscape of AI agents and knowledge assistants, connecting your internal documentation with Large Language Models (LLMs)

Reference Architecture: build a sovereign n8n RAG workflow for AI agent using OVHcloud Public Cloud solutions Read More »

An image with a lock

Safety first: Detect harmful texts using an AI safeguard agent

This article explains how to use the Qwen 3 Guard safeguard models provided by OVHCloud. Using this guide, you can analyse and moderate texts for LLM applications, chat platforms, customer support systems, or any other text-based services requiring safe and compliant interactions. Our focus will be on written content, such as conversations or plain text.

Safety first: Detect harmful texts using an AI safeguard agent Read More »

Agentic AI from a security perspective

Large Language Models (LLMs) and generative AI technologies are everywhere, infiltrating both our personal and professional daily lives. Well-known services are already diverting most internet users away from their old browsing habits, and online information consumption is being profoundly transformed, most likely with no possible return to past behaviours. Issues related to intellectual property laws

Agentic AI from a security perspective Read More »

PostgreSQL and AI: The pragmatic path to smarter data

Beyond the buzz: Building AI on solid foundations Artificial intelligence has quickly become the cornerstone of digital innovation. From text generation to image recognition and intelligent automation, AI is redefining how organisations extract value from data. At OVHcloud, we believe this transformation shouldn’t only belong to the tech elite – it should be open, accessible,

PostgreSQL and AI: The pragmatic path to smarter data Read More »

10 Reasons Scaling Startups Are Migrating to OVHcloud

Cloud infrastructure plays a critical role in how startups scale—affecting everything from product delivery and user experience to budget and compliance. While many startups begin their journey with public cloud giants, the challenges of unpredictable costs, data control, and technical constraints become more apparent as they grow. For startups ready to scale smarter, OVHcloud offers

10 Reasons Scaling Startups Are Migrating to OVHcloud Read More »