Mehrbod Nowrouz
Multimodal Interpretation of Time-Series Discords: A Hybrid Approach using Matrix Profile and Visual Language Models.
Rel. Luca Cagliero, Ali Yassine. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2026
|
Preview |
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (3MB) | Preview |
Abstract
Time-series anomaly detection has traditionally been dominated by statistical methods and narrow Deep Learning architectures. Most well-known algorithms require extensive feature engineering and struggle with complex signals; furthermore, while certain neural models show better performance in pattern recognition, they remain "black boxes" that lack explainability and the ability to incorporate domain context. Often, these systems fall victim to an "illusion of progress" where high numerical scores mask a lack of true semantic understanding of actual system behavior. In contrast, the emergence of Foundational Models introduces a shift towards generalized intelligence, culminating in the development of Vision-Language Models (VLMs). Unlike previous deep learning models, VLMs are natively multimodal, enabling the transformation of time-series data into visual representations.
This thesis investigates whether the reasoning capabilities of VLMs can be leveraged to provide semantic context and pinpoint anomalies—such as frequency drifts or signal distortions—that older models might miss
Relatori
Anno Accademico
Tipo di pubblicazione
Numero di pagine
Corso di laurea
Classe di laurea
Aziende collaboratrici
URI
![]() |
Modifica (riservato agli operatori) |
