Integrating Neural Processing Unit and Attention-based Architecture for Efficient Real-time Face Recognition in Industrial Environments

Davide Aiello

Integrating Neural Processing Unit and Attention-based Architecture for Efficient Real-time Face Recognition in Industrial Environments.

Rel. Luciano Lavagno, Ilario Gerlero, Marcello Babbi. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2024

Abstract

In recent years, the rise of Transformer models and attention mechanisms has significantly revolutionized the field of machine learning, particularly in computer vision and natural language processing. While attention-based architectures enable models to effectively prioritize and understand complex relationships within the data, deploying Transformer models on microcontrollers remains a challenge. Simultaneously, Neural Processing Units (NPUs) have become key hardware accelerators, designed specifically to optimize deep learning tasks. Their efficiency and low power consumption make them ideal for real-time applications in constrained environments. This study focuses on developing a complex deep learning pipeline that leverages attention-based models for real-time face recognition, targeting a cutting-edge microcontroller equipped with an NPU.

The work covers the complete development process of a deep learning system on edge, starting from the selection and potential design modification and training of the neural networks, followed by quantization, deployment and ultimately execution on the target hardware

Relatori

Luciano Lavagno, Ilario Gerlero, Marcello Babbi

Anno Accademico

2024/25

Tipo di pubblicazione

Elettronica

Numero di pagine

Informazioni aggiuntive

Tesi secretata. Fulltext non presente

Corso di laurea

Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering)

Classe di laurea

Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA

Aziende collaboratrici

SENSOR REPLY S.R.L. CON UNICO SOCIO

URI

https://webthesis.biblio.polito.it/id/eprint/33094

Modifica (riservato agli operatori)