Vittorio Pipoli
Tranformer-based architectures for long biological sequences.
Rel. Elena Maria Baralis, Elisa Ficarra, Marta Lovino, Giuseppe Attanasio. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2022
Abstract
Gene expression is the process by which the information encoded in the genes is transcribed and translated into a functional product, allowing cells to react to external impulses and carry out their main functions. Hence, gene expression has primary importance in life, and it is not difficult to imagine that fully understanding such a phenomenon may help cancer diagnosis and drug discovery. State-of-the-art Deep Learning techniques such as Xpresso and Expecto focus on gene expression level prediction by analyzing the raw DNA sequences (tens of thousands of base-pairs long) of each gene extracted from the reference genome, exploiting Convolutional layers. Learning from long sequences is challenging due to the intrinsic nature of the DNA.
Indeed, such models must learn to extract local patterns and long-range dependencies
Relatori
Anno Accademico
Tipo di pubblicazione
Numero di pagine
Informazioni aggiuntive
Corso di laurea
Classe di laurea
Aziende collaboratrici
URI
![]() |
Modifica (riservato agli operatori) |
