David Tonda

Reference Architecture : Retrieval Augmented Generation (RAG)

David Tonda / 30/09/2024 / AI / OVHcloud Engineering

This document presents a reference architecture for a simple Retrieval Augmented Generation solution based on a vector Db using OVHcloud managed services. In this use case we have a large number of pdf/markdown documents that are ingested as a single batch to create a knowledge base and a simple text chat interface for a user

Reference Architecture : Retrieval Augmented Generation (RAG) Read More »

GPU for LLM Inferencing Guide

Reference Architecture : Retrieval Augmented Generation (RAG)