Davide Fassio
Relocalization of Autonomous Agents Using Monocular Depth Estimation on PTZ Cameras.
Rel. Giorgio Guglieri, Alessandro Minervini. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2025
|
Preview |
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (26MB) | Preview |
Abstract
This thesis presents a systematic approach for calibrating a Pan-Tilt-Zoom network camera and integrating it with state-of-the-art object detection and depth estimation algorithms for distance measurement of autonomous agents. A calibration methodology was adopted to compute the intrinsic parameters of the camera across various zoom levels, addressing the non-linear variations induced by dynamic zoom adjustments. This calibration process is fundamental for establishing an accurate mapping between the three-dimensional world and the two-dimensional image plane of the camera. Subsequently, the study focuses on estimating the distance of autonomous agents by leveraging a dual-framework approach. Object detection is performed using YOLOv8, chosen for its balance between computational efficiency and detection accuracy in real-time applications.
For depth computation, a monocular image-based technique is employed using the Apple Depth Pro model, which incorporates a Vision Transformer architecture to capture high-level contextual features and infer depth from a single frame
Relatori
Anno Accademico
Tipo di pubblicazione
Numero di pagine
Corso di laurea
Classe di laurea
URI
![]() |
Modifica (riservato agli operatori) |
