polito.it
Politecnico di Torino (logo)

Transformer-based pre-trained models for Out-Of-Distribution detection

Giulia D'Ascenzi

Transformer-based pre-trained models for Out-Of-Distribution detection.

Rel. Tatiana Tommasi, Francesco Cappio Borlino. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2023

Abstract:

Standard closed-set deep learning approaches deployed in the real-world fail when test samples come from never-seen data distributions, which makes them untrustworthy for safety-critical applications such as autonomous driving or healthcare. A preferable behavior would be to raise an alert in case of unknown object categories, a task known as Out-Of-Distribution (OOD) detection. The common strategy to handle this task is to train (or at least fine-tune) the detector on the target data to make it learn the normality distribution, to then recognize test samples not belonging to it. Consequently, It would be necessary to gather a great amount of labeled target data and to perform the training procedure for each distinct downstream task. Those are restrictive characteristics for many real-world applications, due to data privacy rules, strict memory, and computational constraints (e.g edge computing). Two recently proposed strategies suggest tackling the problem by using only a pre-trained model: one relies on visual relational reasoning, while the other exploits vision and language. Both those methods focus on learning representations generic enough to enable the detection of out-of-distribution samples in multiple tasks, without the need for fine-tuning on the target data. This thesis presents a comparison of the two strategies and proposes a tailored transformer architecture that increases the explainability of the overall model.

Relatori: Tatiana Tommasi, Francesco Cappio Borlino
Anno accademico: 2022/23
Tipo di pubblicazione: Elettronica
Numero di pagine: 78
Informazioni aggiuntive: Tesi secretata. Fulltext non presente
Soggetti:
Corso di laurea: Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering)
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA
Aziende collaboratrici: NON SPECIFICATO
URI: http://webthesis.biblio.polito.it/id/eprint/26822
Modifica (riservato agli operatori) Modifica (riservato agli operatori)