Politecnico di Torino (logo)

Relaxing the Forget Constraints in Open World Recognition

Mauro Musarra

Relaxing the Forget Constraints in Open World Recognition.

Rel. Barbara Caputo, Dario Fontanel, Fabio Cermelli. Politecnico di Torino, Corso di laurea magistrale in Data Science and Engineering, 2022


Vision is a fundamental ability for robots to operate in the real world. In the last few years deep neural networks has significantly improved the state of the art of robotic vision. However, they are mainly trained to recognize only the categories provided in the training set (closed world assumption), being ill equipped to operate in the real world, where new unknown objects may appear over time. In this work, we investigate the open world recognition (OWR) problem that presents two challenges: (i) learn new concepts over time (incremental learning) and (ii) discern between known and unknown categories (open set recognition). Current state-of-the-art OWR methods address incremental learning by employing a knowledge distillation loss. It forces the model to keep the same predictions across training steps, in order to maintain the acquired knowledge. This behaviour may induce the model in mimicking uncertain predictions, preventing it from reaching an optimal representation on the new classes. To overcome this limitation, we propose the Poly loss that penalizes less the changes in the predictions for uncertain samples, while forcing the same output on confident ones. Moreover, we introduce a forget constraint relaxation strategy that allows the model to obtain a better representation of new classes by randomly zeroing the contribution of some old classes from the distillation loss. Finally, while current methods rely on metric learning to detect unknown samples, we propose a new rejection strategy that sidesteps it and directly uses the model classifier to estimate if a sample is known or not. Experiments on three datasets demonstrate that our method outperforms the state of the art.

Relators: Barbara Caputo, Dario Fontanel, Fabio Cermelli
Academic year: 2021/22
Publication type: Electronic
Number of Pages: 72
Additional Information: Tesi secretata. Fulltext non presente
Corso di laurea: Corso di laurea magistrale in Data Science and Engineering
Classe di laurea: New organization > Master science > LM-32 - COMPUTER SYSTEMS ENGINEERING
Aziende collaboratrici: UNSPECIFIED
URI: http://webthesis.biblio.polito.it/id/eprint/22655
Modify record (reserved for operators) Modify record (reserved for operators)