Davide Lezzoche
Design and Optimization of a Winograd Aware IP for Quantized Neural Networks.
Rel. Claudio Passerone, Pierpaolo Mori'. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Elettronica (Electronic Engineering), 2022
|
Preview |
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (7MB) | Preview |
Abstract
Convolutional Neural Networks (CNNs) are increasingly being used in the fieldof deep learning. Among their possible applications are computer vision, speech recognition, and image classification. Nowadays, CNNs have reached very high levels of precision, at the cost of an huge amount of multiplications to perform and parameters to store. The most widely used platforms to accelerate CNNs are GPUs, which are characterized by excellent computing performance. However, their excessive power consumption does not make them the best choice for embedded applications. Instead, FPGAs represent a good compromise between throughput, flexibility, reconfigurability, and energy efficiency. Given the limited resources of FPGAs and the large computational cost and storage demand (both on-chip and off-chip) of CNNs, several optimizations are required to implement an FPGA-based CNN accelerator.
Quantization, loop unrolling, and data vectorization help reduce resource demand and speed up the inference
Relatori
Anno Accademico
Tipo di pubblicazione
Numero di pagine
Corso di laurea
Classe di laurea
URI
![]() |
Modifica (riservato agli operatori) |
