polito.it
Politecnico di Torino (logo)

Multivariate Anomaly Detection Using Frequent Itemset Mining

Arman Behkish

Multivariate Anomaly Detection Using Frequent Itemset Mining.

Rel. Luca Cagliero. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2025

[img] PDF (Tesi_di_laurea) - Tesi
Accesso riservato a: Solo utenti staff fino al 11 Aprile 2026 (data di embargo).
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (6MB)
Abstract:

Widespread use of IoT devices that is predicted to generate 79 zettabytes of data annually by 2025 is only one example to emphasize the importance of time series data mining, especially anomaly detection both in academia and industry. Despite extensive research producing hundreds of algorithms, current frameworks inadequately abstract technical complexity for domain experts. In this dissertation, we introduce a novel framework that efficiently aggregates and summarizes anomaly scores from very high-dimensional datasets comprising millions of data points, enabling flexible query support and precise responses. Our approach employs a windowing technique to transform multidimensional anomaly scores into a transaction database, thereby leveraging established itemset mining algorithms. We use Matrix Profile, a well recognized methods to detect discords, although theoretically this approach can use any anomaly scores. Experimental results on real-world as well as large-scale synthetic datasets demonstrate that our method outperforms existing similar multidimensional approaches in both speed and query capability. To the best of our knowledge, this is the first work to integrate multidimensional anomaly detection with itemset mining. We illustrate its potential through several application scenarios and provide a modular design that invites further extension by the research community.

Relatori: Luca Cagliero
Anno accademico: 2024/25
Tipo di pubblicazione: Elettronica
Numero di pagine: 65
Soggetti:
Corso di laurea: Corso di laurea magistrale in Data Science And Engineering
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA
Aziende collaboratrici: Politecnico di Torino
URI: http://webthesis.biblio.polito.it/id/eprint/35359
Modifica (riservato agli operatori) Modifica (riservato agli operatori)