Deep Neural Networks for Speaker Verfication

Salvatore Sarni

Deep Neural Networks for Speaker Verfication.

Rel. Sandro Cumani. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2020

Preview

PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.
Download (1MB) | Preview

Abstract

Speaker identification and speaker verification are the main tasks in the field of speaker recognition. The former involves inferring the speaker of an utterance from a set of possible identities, whereas the latter aims at assessing whether a claimed identity corresponds to the speaker of a given speech segment. Thanks to the advances in the field of Deep Learning, Deep Neural Networks (DNN) have recently become the state-of-the-art technique for utterance representation in the speaker recognition field. The DNN approach consists in training a neural network to extract speaker embeddings, i.e. fixed dimensional utterance representations that contain speaker-discriminant information. DNN embeddings significantly outperform previous state-of-the-art methods such as i-vectors in terms of verification accuracy.

One of the most effective architectures for speaker embedding extraction is the Time Delay Neural Network (TDNN), which is able to model long range temporal dependencies

Relatori

Sandro Cumani

Anno Accademico

2019/20

Tipo di pubblicazione

Elettronica

Numero di pagine

Corso di laurea

Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering)

Classe di laurea

Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA

URI

https://webthesis.biblio.polito.it/id/eprint/15245

Modifica (riservato agli operatori)