polito.it
Politecnico di Torino (logo)

Exploring incrementality in data subgroups for speech models

Leonardo Moraglia

Exploring incrementality in data subgroups for speech models.

Rel. Eliana Pastor, Alkis Koudounas. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2025

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (8MB) | Preview
Abstract:

This dissertation addresses the analysis of performance disparities between subgroups of data within speech recognition models. More specifically, particular focus will be put on how the performance of subgroups evolves during the training of the models themselves. Having considered previous studies that have highlighted the presence of subgroups and discriminatory biases in speech recognition models, this work will also focus on understanding how these disparities are created and propagated during training. For instance, populations characterized by parameters such as gender, accent, speech rate, or age may undergo either a decline or enhancement in the model's performance as it develops. In conclusion, the detailed examination of the evolution of these disparities across diverse datasets and speech recognition models will be conducted to acquire an in-depth understanding of their propagation. This analysis aims to contribute to the advancement of these technologies towards enhanced fairness and accessibility in the future.

Relatori: Eliana Pastor, Alkis Koudounas
Anno accademico: 2024/25
Tipo di pubblicazione: Elettronica
Numero di pagine: 78
Soggetti:
Corso di laurea: Corso di laurea magistrale in Data Science And Engineering
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA
Aziende collaboratrici: NON SPECIFICATO
URI: http://webthesis.biblio.polito.it/id/eprint/35382
Modifica (riservato agli operatori) Modifica (riservato agli operatori)