OVHcloud Engineering

Follow our adventurous engineering team through the various Tech universe they discover and share on through and for the technical communities

Deep Dive into DeepSeek-R1 – Part 1

Introduction A few weeks ago, the release of the open-source large language model DeepSeek-R1 has taken the AI world by storm. The Chinese research team claimed their new reasoning model was on par with OpenAI’s flagship model o1, open-sourced the model and gave details about the work behind it. In this blog post series, we […]

Deep Dive into DeepSeek-R1 – Part 1 Read More »

Rocket in MistralAI colors in a data center with a French rooster showing rapid LLM deployment

Mistral Small 24B served with vLLM and AI Deploy – a single command to deploy an LLM (Part 1)

You are not dreaming! You can deploy open-source LLM in a single command line. Deploying advanced language models can be a challenge! But this sometimes this arduous task is becoming increasingly accessible, enabling developers to integrate sophisticated AI capabilities into their applications. In this guide, we will walk through deploying the Mistral-Small-24B-Instruct-2501 model using vLLM

Mistral Small 24B served with vLLM and AI Deploy – a single command to deploy an LLM (Part 1) Read More »

Falco plugin for OVHcloud MKS Audit Logs

Enhancing Kubernetes Security: Detecting Threats in OVHcloud Managed Kubernetes cluster (MKS) Audit Logs with Falco

Several month ago we discovered Falco, a Cloud Native near real-time threats detection tool, and we saw how to install it on an OVHcloud MKS cluster. Today we will connect our Falco instance to a MKS cluster in order to retrieve Kubernetes Audit Logs events and watch if everything is OK in our cluster. Concretely,

Enhancing Kubernetes Security: Detecting Threats in OVHcloud Managed Kubernetes cluster (MKS) Audit Logs with Falco Read More »

A woman coding helped by a parrot and a whale

Release of DeepSeek-R1 on OVHcloud AI Endpoints

🚀 We are thrilled to announce the release of Deepseek-R1-Distill-Llama-70B on AI Endpoints! Distilled from Deepseek-R1, a powerful model excels in math, coding, and reasoning tasks. With AI Endpoints, you can integrate this model into your applications without needing extensive AI expertise. Our platform is designed with simplicity, security, and data privacy in mind, ensuring

Release of DeepSeek-R1 on OVHcloud AI Endpoints Read More »

Infrastructure as Code (IaC) on OVHcloud – part 2: Pulumi

In our previous article, Infrastructure as Code (IaC) on OVHcloud – part 1: Terraform / OpenTofu, we saw that deploying manually take time and it’s complicated. That why Infrastructure as Code (IaC) is powerful and allow you to automate infrastructure provisioning. But do you know that Terraform/OpenTofu is not the only tool that exists? In

Infrastructure as Code (IaC) on OVHcloud – part 2: Pulumi Read More »

🧠AI Concepts in a Nutshell: LLM Optimization

RAG vs. Fine-Tuning Choosing the Right Method for External Knowledge In AI development, incorporating proprietary data and external knowledge is crucial. Two key methodologies are Retrieval Augmented Generation (RAG) and fine-tuning. Here’s a quick comparison. 𝐑𝐞𝐭𝐫𝐢𝐞𝐯𝐚𝐥 𝐀𝐮𝐠𝐦𝐞𝐧𝐭𝐞𝐝 𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐨𝐧 (𝐑𝐀𝐆) 🔍 RAG combines an LLM’s reasoning with external knowledge through three steps:1️⃣ Retrieve: Identify related documents

🧠AI Concepts in a Nutshell: LLM Optimization Read More »

Create Kubernetes clusters with OVHcloud Managed Rancher Service

Container orchestration is now essential for modern application deployment, providing scalability, flexibility, and resource efficiency. It has become common to have to manage several Kubernetes clusters, but doing so effectively requires the right tools. Fortunately, OVHcloud offers a solution that enables you to manage all your Kubernetes clusters from a single, centralized management tool: Managed

Create Kubernetes clusters with OVHcloud Managed Rancher Service Read More »

🧠 AI concept in a Nutshell: LLM series.

LLM (Large Language Model) has undoubtedly been one of the most buzzing topics over the past two years, since the release of ChatGPT by OpenAI. 𝗧𝗵𝗲 𝗕𝗮𝘀𝗶𝗰𝘀 𝗼𝗳 𝗟𝗟𝗠𝘀 Large Language Models are essentially sophisticated AI systems designed to understand and generate human-like text. What makes them large” is the sheer volume of data they’re

🧠 AI concept in a Nutshell: LLM series. Read More »

An happy AI

Create a sentiments analyzer with AI Endpoints

If you want to have more information on AI Endpoints, please read the following blog post.You can, also, have a look at our previous blog posts on how use AI Endpoints. OVHcloud AI Endpoints allows developers to easily add AI features to there day to day developments. In this blog post, we will explore how

Create a sentiments analyzer with AI Endpoints Read More »

Reference Architecture : Retrieval Augmented Generation (RAG)

This document presents a reference architecture for a simple Retrieval Augmented Generation solution based on a vector Db using OVHcloud managed services. In this use case we have a large number of pdf/markdown documents that are ingested as a single batch to create a knowledge base and a simple text chat interface for a user

Reference Architecture : Retrieval Augmented Generation (RAG) Read More »