Enrico Porcelli
Evaluation of the impact of the Multi-Head Attention algorithm in Music Source Separation.
Rel. Eliana Pastor, Moreno La Quatra, Alkis Koudounas. Politecnico di Torino, Master of science program in Data Science And Engineering, 2024
|
Preview |
PDF (Tesi_di_laurea)
- Thesis
Licence: Creative Commons Attribution Non-commercial No Derivatives. Download (6MB) | Preview |
Abstract
This work focuses on the evaluation of the impact of the Multi-Head Attention algorithm in the field of Music Source Separation. In particular, our objective is to determine its potential to outperform the U-Net architecture often employed in state-of-the-art (SOTA) models. Additional primary goals include examining the repercussions of integrating Self-Supervised features into the pipeline and assessing the efficacy of the Attention mechanism for phase estimation. Notably, when utilizing the magnitude spectrogram as input, our model demonstrated promising outcomes, especially when using an increased volume of training data. The incorporation of Self-Supervised features into the model's architecture proved to be effective only when all layer representations are combined into a weighted sum.
Blindly concatenating the last layer appeared to be less beneficial to the model's performance
Relators
Academic year
Publication type
Number of Pages
Course of studies
Classe di laurea
URI
![]() |
Modify record (reserved for operators) |
