polito.it
Politecnico di Torino (logo)

Deep Domain Adaptation through Inter-modal Self-supervision

Luca Robbiano

Deep Domain Adaptation through Inter-modal Self-supervision.

Rel. Barbara Caputo, Mirco Planamente, Mohammadreza Loghmani. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2020

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (4MB) | Preview
Abstract:

Computer vision in robotics makes heavy usage of RGB-D data. However, collecting large manually annotated datasets is extremely time-consuming and therefore costly. A potential solution is to automatically generate synthetic datasets and to use them in order to make predictions on the real data. Nevertheless, the domain shift between the synthetic dataset (source domain) and the real data (target domain) partially invalidates the effectiveness of this solution, yielding an accuracy significantly lower than the one that would be obtained using labelled real data. In order to overcome this issue, multiple domain adaptation methods have been developed. These methods can also be employed in a multimodal scenario like RGB-D, but none of them exploits the existing relationship between modalities. We propose a novel domain adaptation method which allows reducing the domain shift by forcing the convolutional neural network to learn the connection between RGB and Depth images through a secondary self-supervised task. Extensive experiments on object categorization show that the exploitation of inter-modal relation can significantly enhance the performance of the main classification task.

Relatori: Barbara Caputo, Mirco Planamente, Mohammadreza Loghmani
Anno accademico: 2019/20
Tipo di pubblicazione: Elettronica
Numero di pagine: 61
Soggetti:
Corso di laurea: Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering)
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA
Aziende collaboratrici: NON SPECIFICATO
URI: http://webthesis.biblio.polito.it/id/eprint/14498
Modifica (riservato agli operatori) Modifica (riservato agli operatori)