Bo Wang
Real-time speech recognition using spiking neural networks.
Rel. Stefano Di Carlo, Alessandro Savino, Alessio Carpegna. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Elettronica (Electronic Engineering), 2024
|
Preview |
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (950kB) | Preview |
Abstract
SNN (Spiking Neural Networks) has long been a popular research area in deep learning, combining neuroscience and machine learning to create models that most closely simulate the mechanisms of biological neurons for computation. However, the model is still in the exploratory research phase. This paper explores the use of SNN models to complete a real-world application—Real-time Speech Recognition using Spiking Neural Networks. The STM32 MEMS microphone is used as the sound input in this study, and MFCC is applied to process the audio data, which is then converted into spike encoding for SNN training. The trained model is deployed on the PYNQ board for real-time speech recognition.
Testing showed that the model achieved a recognition accuracy of up to 96.25%
Relatori
Anno Accademico
Tipo di pubblicazione
Numero di pagine
Corso di laurea
Classe di laurea
Aziende collaboratrici
URI
![]() |
Modifica (riservato agli operatori) |
