Christian D'Alleva
Multimodal Transformer for Spatio-Temporal Urban Air Quality Forecasting: An Ablation Study over Milan.
Rel. Andrea Bottino, Luca Barco, Lorenzo Innocenti, Giacomo Blanco. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2026
|
Preview |
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (6MB) | Preview |
Abstract
Multimodal learning has become increasingly relevant in environmental modelling, where heterogeneous sources such as satellite observations and meteorological measurements are jointly used to estimate urban air pollutant concentrations. Despite encouraging predictive performance, it remains unclear which modalities and temporal components truly contribute useful information, and which instead introduce redundancy or noise. Understanding these relationships is essential for designing efficient and interpretable models rather than progressively more complex architectures. This thesis proposes a multimodal framework for urban air quality forecasting over Milan. The approach integrates multi-sensor satellite imagery (Sentinel-1, Sentinel-2, Sentinel-3, and Sentinel-5), land-cover data, and a digital elevation model together with meteorological observations.
These data are fused within a self-supervised pretraining strategy based on an adaptation of the VideoMAE architecture, enabling the extraction of representative spatio-temporal features describing air quality dynamics
Relatori
Anno Accademico
Tipo di pubblicazione
Numero di pagine
Corso di laurea
Classe di laurea
Aziende collaboratrici
URI
![]() |
Modifica (riservato agli operatori) |
