Gabriele Cirotto
Evaluating the Impact of AI-Generated Data on Training Keyword Spotting Models.
Rel. Andrea Calimera, Valentino Peluso. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2024
PDF (Tesi_di_laurea)
- Tesi
Accesso riservato a: Solo utenti staff fino al 31 Ottobre 2025 (data di embargo). Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (4MB) |
Abstract: |
Keyword Spotting (KWS) systems have become common in everyday applications, used in virtual assistants (i.e. Amazon’s Alexa) and voice-controlled devices. These systems are typically part of complex architectures designed to simplify daily tasks and they work by continuously monitoring audio input for specific "wake words", such as "Hey Siri" or "Alexa", triggering an action or response when those words are detected through recognition models. A key challenge in designing KWS systems is the data collection process, which is often resource-consuming, especially when employing deep learning models since they require many high-quality recordings for effective training. The thesis examined the impact of blending well-known KWS datasets with synthetic samples generated by modern Text-To-Speech (TTS) systems. The objective was to determine whether integrating synthetic data could reduce the resources required for dataset construction while maintaining model performance. Several hybrid datasets were built, combining original KWS datasets with synthetic speech, and were then used to train state-of-the-art KWS models. Finally, the results were compared to models trained solely on original data. The experiments revealed a performance drop when synthetic data was introduced, with the decrease becoming more evident as the number of synthetic samples increased. This result highlights the need for particular care when using synthetic data to ensure the quality of KWS models is not compromised. |
---|---|
Relatori: | Andrea Calimera, Valentino Peluso |
Anno accademico: | 2024/25 |
Tipo di pubblicazione: | Elettronica |
Numero di pagine: | 61 |
Soggetti: | |
Corso di laurea: | Corso di laurea magistrale in Data Science And Engineering |
Classe di laurea: | Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA |
Aziende collaboratrici: | Politecnico di Torino |
URI: | http://webthesis.biblio.polito.it/id/eprint/33237 |
Modifica (riservato agli operatori) |