polito.it
Politecnico di Torino (logo)

Unsupervised Cross-Domain Feature Extraction Using Autoencoders for Analyzing Single-Cell Images from Acute Myeloid Leukemia Patients

Raheleh Salehi

Unsupervised Cross-Domain Feature Extraction Using Autoencoders for Analyzing Single-Cell Images from Acute Myeloid Leukemia Patients.

Rel. Tatiana Tommasi. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2023

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (6MB) | Preview
Abstract:

Identification and classification of white blood cells in peripheral blood smears is a key step for the diagnosis of hematological malignancies. Different lab procedures, illumination, staining, and microscope settings are some of the factors resulting in domain shifts which hampers the applicability and reusability of machine learning methods when applied to data collected from different sites. In this thesis, we propose an autoencoder to extract unsupervised cross-domain features on three different datasets of single white blood cells. Using a Mask R-CNN architecture as a first step allows the autoencoder to focus on the relevant white blood cell and eliminate artifacts in the image. We use a simple random forest method to classify the extracted features of the single cells as a way to evaluate the quality of the features extracted by the autoencoder. We show that the random forest classifier trained on only one of the datasets, can perform satisfactorily on the unseen datasets thanks to the rich features extracted by the autoencoder, and in the cross-domain task, it outperforms the published oracle models. According to the results, this unsupervised approach can be employed in more complicated diagnosis and prognosis tasks without the need to add expensive expert labels to unobserved datasets and it is expected that it will improve the applicability and reliability of these systems in different centers and hospitals.

Relatori: Tatiana Tommasi
Anno accademico: 2022/23
Tipo di pubblicazione: Elettronica
Numero di pagine: 40
Soggetti:
Corso di laurea: Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering)
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA
Aziende collaboratrici: Helmholtz Zentrum München
URI: http://webthesis.biblio.polito.it/id/eprint/26885
Modifica (riservato agli operatori) Modifica (riservato agli operatori)