A data analytics integrative approach for multi-omics clustering in leukemia samples

Stefano Nardella

A data analytics integrative approach for multi-omics clustering in leukemia samples.

Rel. Elisa Ficarra, Marta Lovino. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2021

Preview

PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.
Download (1MB) | Preview

Abstract

In the last decades, the decrease in the cost of next-generation sequencing (NGS) technologies has allowed the widespread of many omics data (e.g., transcriptomics, genomics). This thesis focuses on a multi-omics approach to cluster patients so that similar ones are assigned to the same cluster, simultaneously considering all data sources. The proposed method has been evaluated on patients affected by myeloid and lymphoid leukemias (AML, ALL). The method considers two types of transcriptomics data, miRNA and mRNA expression. The expression measures the quantity of the molecule in the sample, which is crucial in regulating transcriptional and post-transcriptional processes. Many techniques based on multi-omics clustering of samples are presented.

Among them, tools based on joint dimensionality reduction techniques -jDR- (e.g., JIVE and GCCA) should be mentioned