DevOps

Warden: the self-healing framework for local actions

Warden: the self-healing framework for local actions

This article is the follow up to Selfheal at Webhosting – The External Part published on 2020-07-17.Part two below covers the local self-healing system. Introduction With over 15-000 servers dedicated to providing services for 6 million websites and web applications of all sorts, across multiple data-centers and geographical zones, a certain amount of software failures …

Warden: the self-healing framework for local actions Read More »

The Bastion - Part 3

The Bastion – Part 3 – Security at the core

In previous parts, we’ve covered the basic principles of the bastion. We then explained how delegation was at the core of the system. This time, we’ll dig into some governing principles of how The Bastion is written. In a nutshell, the main purpose of the bastion is to ensure security, auditability and reliability in all …

The Bastion – Part 3 – Security at the core Read More »

Selfheal at Webhosting – The external part

Selfheal at Webhosting – The external part

Introduction With almost 6000000 websites hosted on more than 15000 servers, the OVHcloud Webhosting SRE team manage lots of alerts during their working day. Our infrastructure is constantly growing, but to scale smoothly, the amount of time spent solving alerts should not increase proportionally. We need, therefore, some tools to help us.  In our team, we …

Selfheal at Webhosting – The external part Read More »