Davide Aiello
Integrating Neural Processing Unit and Attention-based Architecture for Efficient Real-time Face Recognition in Industrial Environments.
Rel. Luciano Lavagno, Ilario Gerlero, Marcello Babbi. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2024
Abstract
In recent years, the rise of Transformer models and attention mechanisms has significantly revolutionized the field of machine learning, particularly in computer vision and natural language processing. While attention-based architectures enable models to effectively prioritize and understand complex relationships within the data, deploying Transformer models on microcontrollers remains a challenge. Simultaneously, Neural Processing Units (NPUs) have become key hardware accelerators, designed specifically to optimize deep learning tasks. Their efficiency and low power consumption make them ideal for real-time applications in constrained environments. This study focuses on developing a complex deep learning pipeline that leverages attention-based models for real-time face recognition, targeting a cutting-edge microcontroller equipped with an NPU.
The work covers the complete development process of a deep learning system on edge, starting from the selection and potential design modification and training of the neural networks, followed by quantization, deployment and ultimately execution on the target hardware
Relatori
Anno Accademico
Tipo di pubblicazione
Numero di pagine
Informazioni aggiuntive
Corso di laurea
Classe di laurea
Aziende collaboratrici
URI
![]() |
Modifica (riservato agli operatori) |
