Real-time speech recognition using spiking neural networks

Bo Wang

Real-time speech recognition using spiking neural networks.

Rel. Stefano Di Carlo, Alessandro Savino, Alessio Carpegna. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Elettronica (Electronic Engineering), 2024

Preview

PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.
Download (950kB) | Preview

Abstract

SNN (Spiking Neural Networks) has long been a popular research area in deep learning, combining neuroscience and machine learning to create models that most closely simulate the mechanisms of biological neurons for computation. However, the model is still in the exploratory research phase. This paper explores the use of SNN models to complete a real-world application—Real-time Speech Recognition using Spiking Neural Networks. The STM32 MEMS microphone is used as the sound input in this study, and MFCC is applied to process the audio data, which is then converted into spike encoding for SNN training. The trained model is deployed on the PYNQ board for real-time speech recognition.

Testing showed that the model achieved a recognition accuracy of up to 96.25%