polito.it
Politecnico di Torino (logo)

Understanding and Enhancing Visual Place Recognition through Embedding Space Interpretability and Uncertainty Estimation

Davide Sferrazza

Understanding and Enhancing Visual Place Recognition through Embedding Space Interpretability and Uncertainty Estimation.

Rel. Carlo Masone, Gabriele Moreno Berton, Gabriele Trivigno. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2025

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (22MB) | Preview
Abstract:

Visual Place Recognition (VPR) involves determining the geographic location of a photo based solely on its visual content. Recent advancements in Deep Learning (DL) have enabled the representation of images in high-dimensional spaces, where photos taken in the same location tend to cluster together, while images from different places are spread apart. This spatial organization makes it easier to predict locations by performing similarity searches against a database of known places. However, a key gap in current research is understanding the specific information retained in these image embeddings that allows for effective and reliable location prediction. Additionally, existing State-of-the-Art (SOTA) deterministic methods in VPR are unable to quantify the uncertainty of their predictions. This is particularly problematic in safety-critical applications, such as autonomous driving, where knowing the confidence level in a system's decision is vital for ensuring safety. This thesis addresses two main challenges: first, understanding and visualizing the essential information encoded in image embeddings, and second, providing uncertainty estimates for VPR models through post-hoc techniques. To overcome these challenges, the thesis employs Generative Artificial Intelligence models, particularly Latent Diffusion Models, to explore and visualize the content within image embeddings. Additionally, uncertainty estimation methods are incorporated to enhance the robustness and reliability of VPR systems. The contributions of this thesis provide valuable insights into the interpretability and reliability of VPR systems, offering a framework for analyzing the output of these models and incorporating uncertainty quantification during inference.

Relatori: Carlo Masone, Gabriele Moreno Berton, Gabriele Trivigno
Anno accademico: 2024/25
Tipo di pubblicazione: Elettronica
Numero di pagine: 141
Soggetti:
Corso di laurea: Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering)
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA
Aziende collaboratrici: NON SPECIFICATO
URI: http://webthesis.biblio.polito.it/id/eprint/35407
Modifica (riservato agli operatori) Modifica (riservato agli operatori)