Carlo Orientale Caputo
Plasticity across neural hierarchies in artificial neural network.
Rel. Andrea Pagnani, Matteo Marsili. Politecnico di Torino, Corso di laurea magistrale in Physics Of Complex Systems (Fisica Dei Sistemi Complessi), 2023
|
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (2MB) | Preview |
Abstract: |
Deep neural networks can extract a hierarchy of relevant features from the data that can be used both for classification and generation task, reaching state-of-the-art performance in object/speech recognition and language translation. However, many characteristics of the way these networks process the information or, more in general, the reason why they work so well are still unclear. In this work we analyze some features of a deep belief network during training across different layers in an unsupervised setting. First of all we study how the plasticity varies across the network’s layers, computing the variation of the architecture’s weights when the dataset to be learned is changed. We observe an increasing behaviour of the plasticity across layers, meaning that the features learned in deep layers are more dataset dependent, instead the shallow ones are more generic. Then we analyze some features of the internal representation (i.e. the probability distribution) of the hidden layers, finding that shallow layers are well described by a pairwise model, while in deep layers, higher order interactions seem to be more present. This could be related with the hierarchical extraction of features performed by the network. Finally we observe that the representations across the layers become close to the hierarchical feature model, that is a theoretical model consistent with the principles of maximal a priori ignorance and of maximal relevance, describing the internal representation of a learning machine. |
---|---|
Relatori: | Andrea Pagnani, Matteo Marsili |
Anno accademico: | 2022/23 |
Tipo di pubblicazione: | Elettronica |
Numero di pagine: | 18 |
Soggetti: | |
Corso di laurea: | Corso di laurea magistrale in Physics Of Complex Systems (Fisica Dei Sistemi Complessi) |
Classe di laurea: | Nuovo ordinamento > Laurea magistrale > LM-44 - MODELLISTICA MATEMATICO-FISICA PER L'INGEGNERIA |
Aziende collaboratrici: | ICTP |
URI: | http://webthesis.biblio.polito.it/id/eprint/27940 |
Modifica (riservato agli operatori) |