Erika Bosco
Real Time Anomaly Detection on Multivariate Time-Series.
Rel. Daniele Apiletti. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2023
|
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (1MB) | Preview |
Abstract: |
The anomaly detection is a specific branch on the data analysis that focuses on finding and highlighting the outliers of the dataset. It is useful across a range of real-world settings, including cyber-security, manufacturing, fraud detection, medical imaging, and other. In particular, in a multivariate time-series the anomalies are both depending on the state of a single time-series and the reciprocate distance between features. This thesis aims to find the anomalies in a dataset derived from the sysmetric, an Oracle system view, that collects database health and performances every minute. To achieve this goal we collected, cleaned and preprocessed the data in order to test different unsupervised machine learning models -abod, feature bagging, pca, gaussian mixture model, cblof, isolation forest- and compared the results with a validation dataset previously cleaned and preprocessed. The results obtained are optimal for the purpose of Mediamente Consulting srl as they can help to change the approach to the database monitoring, now reactive, to become proactive. |
---|---|
Relators: | Daniele Apiletti |
Academic year: | 2022/23 |
Publication type: | Electronic |
Number of Pages: | 56 |
Subjects: | |
Corso di laurea: | Corso di laurea magistrale in Data Science And Engineering |
Classe di laurea: | New organization > Master science > LM-32 - COMPUTER SYSTEMS ENGINEERING |
Aziende collaboratrici: | Mediamente Consulting srl |
URI: | http://webthesis.biblio.polito.it/id/eprint/27686 |
Modify record (reserved for operators) |