polito.it
Politecnico di Torino (logo)

Deep neural networks for language recognition

Axel Baron

Deep neural networks for language recognition.

Rel. Sandro Cumani. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2023

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (2MB) | Preview
Abstract:

This work belongs in the field of Language Recognition, which ultimately aims to deter- mine the language of a given speech sample. This work mostly focuses on the study of acoustic feature extraction and of Deep neural network technologies in order to solve language recognition problems. In particular, the Log-mel extraction technique and the ECAPA-TDNN model are the main technologies that drew my attention and motivated this work. This document begins by introducing the concepts and history of Language Recog- nition. Then I explain what acoustic features are, their purpose and the specific case of log-mel features. To follow there is a report about Neural Networks which slides towards the more complex case of Deep neural network and the case of ECAPA-TDNN model. In the end, there is my experimental setup, the decisions I made to treat this subject as well as the analysis of my results.

Relatori: Sandro Cumani
Anno accademico: 2022/23
Tipo di pubblicazione: Elettronica
Numero di pagine: 49
Soggetti:
Corso di laurea: Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering)
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA
Aziende collaboratrici: NON SPECIFICATO
URI: http://webthesis.biblio.polito.it/id/eprint/26761
Modifica (riservato agli operatori) Modifica (riservato agli operatori)