Gaetano Riccardo Ricotta
Propagation pattern mining algorithm: a parallel approach.
Rel. Paolo Garza, Luca Colomba. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2023
|
Preview |
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (5MB) | Preview |
Abstract
Given the widespread adoption of connected devices and increasingly precise sensors, the volume of spatiotemporal data has grown significantly in recent years. This growth necessitates the development of tools capable of analyzing large amounts of spatial and temporal data to extract relevant information and train machine learning models. However, this data can be heterogeneous and come from various sources, such as weather measurements, traffic information, weather conditions, and road accidents. In particular, this thesis aims to improve the efficiency of an existing framework for spatiotemporal data analysis by parallelizing most of the process stages using the Spark framework. The goal is to make the processing of large amounts of heterogeneous data, including US road accident data and weather condition data, more efficient in order to extract correlations between spatial and temporal events.
The framework involves several stages, including event deduplication, parent-child event correlation, final tree construction, and frequent pattern extraction
Relatori
Anno Accademico
Tipo di pubblicazione
Numero di pagine
Corso di laurea
Classe di laurea
URI
![]() |
Modifica (riservato agli operatori) |
