polito.it
Politecnico di Torino (logo)

Real Time Anomaly Detection on Multivariate Time-Series

Erika Bosco

Real Time Anomaly Detection on Multivariate Time-Series.

Rel. Daniele Apiletti. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2023

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (1MB) | Preview
Abstract:

The anomaly detection is a specific branch on the data analysis that focuses on finding and highlighting the outliers of the dataset. It is useful across a range of real-world settings, including cyber-security, manufacturing, fraud detection, medical imaging, and other. In particular, in a multivariate time-series the anomalies are both depending on the state of a single time-series and the reciprocate distance between features. This thesis aims to find the anomalies in a dataset derived from the sysmetric, an Oracle system view, that collects database health and performances every minute. To achieve this goal we collected, cleaned and preprocessed the data in order to test different unsupervised machine learning models -abod, feature bagging, pca, gaussian mixture model, cblof, isolation forest- and compared the results with a validation dataset previously cleaned and preprocessed. The results obtained are optimal for the purpose of Mediamente Consulting srl as they can help to change the approach to the database monitoring, now reactive, to become proactive.

Relatori: Daniele Apiletti
Anno accademico: 2022/23
Tipo di pubblicazione: Elettronica
Numero di pagine: 56
Soggetti:
Corso di laurea: Corso di laurea magistrale in Data Science And Engineering
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA
Aziende collaboratrici: Mediamente Consulting srl
URI: http://webthesis.biblio.polito.it/id/eprint/27686
Modifica (riservato agli operatori) Modifica (riservato agli operatori)