polito.it
Politecnico di Torino (logo)

Self-healing Mechanisms: Analysis and Implementation for Stringent-SLAs Products

Emanuele Falci

Self-healing Mechanisms: Analysis and Implementation for Stringent-SLAs Products.

Rel. Maurizio Rebaudengo. Politecnico di Torino, Master of science program in Computer Engineering, 2025

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (4MB) | Preview
Abstract:

This thesis explores self-healing mechanisms in cloud applications, with a focus on reducing manual intervention in system reliability engineering (SRE). Conducted as part of an internship at Amadeus, the study examines existing self-healing solutions, evaluates their integration within the company’s technology stack, and proposes improvements to automated incident remediation. The research covers built-in self-healing features of key infrastructure components such as Couchbase, Oracle DB, Kubernetes, and cloud platforms. Additionally, it investigates external monitoring and automation tools, including ARGOS, ServiceNow, and AWX, to enhance alert management and remediation processes. A key outcome of this work is the enhancement of Amadeus’ "Auto Remediation of Alerts" framework, improving its ability to autonomously resolve common incidents. The findings contribute to the broader field of SRE by showing how automation can improve system availability, reduce operational costs, and enhance response times to failures.

Relators: Maurizio Rebaudengo
Academic year: 2024/25
Publication type: Electronic
Number of Pages: 59
Subjects:
Corso di laurea: Master of science program in Computer Engineering
Classe di laurea: New organization > Master science > LM-32 - COMPUTER SYSTEMS ENGINEERING
Ente in cotutela: INSTITUT EURECOM (FRANCIA)
Aziende collaboratrici: AMADEUS SAS
URI: http://webthesis.biblio.polito.it/id/eprint/35419
Modify record (reserved for operators) Modify record (reserved for operators)