polito.it
Politecnico di Torino (logo)

Optimizing Image Retrieval for Robust Visual Localization.

Pablo Torasso

Optimizing Image Retrieval for Robust Visual Localization.

Rel. Carlo Masone, Gabriele Trivigno, Gabriele Moreno Berton. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2025

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (18MB) | Preview
Abstract:

Visual localization, the process of determining a camera’s exact 6-DoF pose within a known environment, is fundamental to applications such as autonomous vehicles, augmented reality, and robotics. Traditional methods like GNSS offer only coarse position estimates and are often unreliable in indoor or visually challenging scenarios, underscoring the need for alternative, high-precision approaches. This thesis addresses these challenges by focusing on the image retrieval component within the visual localization pipeline—a critical stage that narrows the search space for computationally expensive local feature matching and thereby enhances overall system efficiency and accuracy. The research systematically evaluates state-of-the-art image retrieval models, including NetVLAD, AP-GeM, and SALAD, under diverse environmental conditions such as varying illumination, seasonal changes, and significant viewpoint differences. A key aspect of this study is the analysis of image angular diversity and its impact on retrieval performance, which reveals the sensitivity of current methods to changes in camera orientation. Additionally, the integration of local feature descriptors with global image representations is explored to further improve discrimination between similar scenes and reduce false positives. Through comprehensive experiments and performance benchmarks, the thesis develops optimization strategies that enhance both the scalability and performances of visual localization systems. The results provide practical guidelines for deploying advanced image retrieval techniques in large-scale, real-world environments, thereby advancing the state-of-the-art in visual localization technology.

Relatori: Carlo Masone, Gabriele Trivigno, Gabriele Moreno Berton
Anno accademico: 2024/25
Tipo di pubblicazione: Elettronica
Numero di pagine: 85
Soggetti:
Corso di laurea: Corso di laurea magistrale in Data Science And Engineering
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA
Aziende collaboratrici: NON SPECIFICATO
URI: http://webthesis.biblio.polito.it/id/eprint/35410
Modifica (riservato agli operatori) Modifica (riservato agli operatori)