Energy-Efficient Quality Adaptation for Recurrent Neural Networks

Francesco Panini

Energy-Efficient Quality Adaptation for Recurrent Neural Networks.

Rel. Massimo Poncino. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2018

Preview

PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.
Download (4MB) | Preview

Abstract

Recurrent Neural Networks - RNNs are state-of-the-art models able to deliver very high accuracy in sequence modeling and machine translation tasks. In particu- lar the Encoder-Decoder architecture excels in sequence-to-sequence tasks in which input and output sequences may not have the same length. These networks work in two stages, at first the input sequence is encoded in a fixed length representa- tion, which is then decoded in order to produce a new target sequence. Due to the abundance of the network parameters, performing inference using these models requires a high computing power and results in large energy consumption, typically unsustainable for an embedded device.

While executing the inference on edge nodes is beneficial in terms of latency and responsiveness of the system, generally such nodes do not have the hardware resources needed to sustain the heavy computa- tions involved

Relatori

Massimo Poncino

Anno Accademico

2018/19

Tipo di pubblicazione

Elettronica

Numero di pagine

Corso di laurea

Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering)

Classe di laurea

Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA

URI

https://webthesis.biblio.polito.it/id/eprint/8990

Modifica (riservato agli operatori)