Training and inference of fully connected networks with resistive memories: design and optimization of multiple conductance-based structure, and of a novel architecture to implement an arbitrary activation function

Giorgio Cristiano

Training and inference of fully connected networks with resistive memories: design and optimization of multiple conductance-based structure, and of a novel architecture to implement an arbitrary activation function.

Rel. Carlo Ricciardi, Candido Pirri. Politecnico di Torino, Corso di laurea magistrale in Nanotechnologies For Icts (Nanotecnologie Per Le Ict), 2018

Abstract

Nowadays the AI has reached human-like, or even better-than-human performances in tasks such as classification, recognition, etc.,so it is gaining more and more use in the everyday life. However, as of today, these networks run on non-dedicated hardware such as CPUs or GPUs, and to achieve such high results, these systems require a lot more power, time and area than necessary. Thus a lot of effort has been put into developing dedicated hardware, such as the digital accelerator TPU developed by Google, to drastically reduce the used resources. However, such accelerators still follow a Von-Neumann architecture, and the necessity of moving data from the memory to a processor is still a heavily limiting factor.

A possible solution is to encode the synaptic weights into analog memories, and perform the operations directly at the location of the data