Antonio De Cinque
Towards Temporal Consistency in Egocentric Object Detection for Open-Vocabulary Navigation.
Rel. Giuseppe Bruno Averta, Claudia Cuttano, Gabriele Tiboni, Marco Ciccone. Politecnico di Torino, Master of science program in Computer Engineering, 2024
|
Preview |
PDF (Tesi_di_laurea)
- Thesis
Licence: Creative Commons Attribution Non-commercial No Derivatives. Download (87MB) | Preview |
Abstract
Egocentric object detection is a critical aspect of robotic navigation and interaction within dynamic and complex home environments. The primary objective of this research is to explore the challenges and solutions associated with achieving temporal consistency in egocentric object detection, particularly in the scope of the Open-Vocabulary Mobile Manipulation (OVMM) challenge. This is contextualized within the HomeRobot 3D simulation environment, where a robot (Hello Robot Stretch) is tasked with navigating a household and bring an object from one place to another. The perception module of the robot is enabled by open-vocabulary object detection models, such as DETIC (Detecting Twenty-thousand Classes using Image-level Supervision).
These models have shown to be promising in recognizing a wide range of objects, given any prompt
Relators
Academic year
Publication type
Number of Pages
Course of studies
Classe di laurea
URI
![]() |
Modify record (reserved for operators) |
