polito.it
Politecnico di Torino (logo)

Real Time Anomaly Detection on Multivariate Time-Series

Erika Bosco

Real Time Anomaly Detection on Multivariate Time-Series.

Rel. Daniele Apiletti. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2023

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (1MB) | Preview
Abstract:

The anomaly detection is a specific branch on the data analysis that focuses on finding and highlighting the outliers of the dataset. It is useful across a range of real-world settings, including cyber-security, manufacturing, fraud detection, medical imaging, and other. In particular, in a multivariate time-series the anomalies are both depending on the state of a single time-series and the reciprocate distance between features. This thesis aims to find the anomalies in a dataset derived from the sysmetric, an Oracle system view, that collects database health and performances every minute. To achieve this goal we collected, cleaned and preprocessed the data in order to test different unsupervised machine learning models -abod, feature bagging, pca, gaussian mixture model, cblof, isolation forest- and compared the results with a validation dataset previously cleaned and preprocessed. The results obtained are optimal for the purpose of Mediamente Consulting srl as they can help to change the approach to the database monitoring, now reactive, to become proactive.

Relators: Daniele Apiletti
Academic year: 2022/23
Publication type: Electronic
Number of Pages: 56
Subjects:
Corso di laurea: Corso di laurea magistrale in Data Science And Engineering
Classe di laurea: New organization > Master science > LM-32 - COMPUTER SYSTEMS ENGINEERING
Aziende collaboratrici: Mediamente Consulting srl
URI: http://webthesis.biblio.polito.it/id/eprint/27686
Modify record (reserved for operators) Modify record (reserved for operators)