Politecnico di Torino (logo)

Semi-supervised classification in the Censored Block Model

Filippo Zimmaro

Semi-supervised classification in the Censored Block Model.

Rel. Alfredo Braunstein, Romain Couillet, Lorenzo Dell'Amico. Politecnico di Torino, Corso di laurea magistrale in Physics Of Complex Systems (Fisica Dei Sistemi Complessi), 2021

PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (781kB) | Preview

Semi-supervised learning is characterized by a partially labelled dataset, in the sense that we know the true labels, i.e. classes, of a usually small fraction of the datapoints. Optimally embedding this "semi-supervised information" in the learning process is a hard task, that can be carried out through the development of specific algorithms and an efficient graph construction. Here we focus more on the latter point: restricted to the semi-supervised version of a paradigmatic model for the branch of the statistical physics that deals with AI concepts, namely the Censored Block Model (or Planted Spin Glass), we propose various graph constructions and give them theoretical justifications. After having developed those configurations, we apply on them two of the simplest algorithms of graph-based inference, namely the Naive Mean Field and the one based on the Adjacency matrix. We study specifically the latter and discover an inefficiency in the learning process strongly similar to the one found and fixed by Mai and Couillet in their work on Laplacian Regularization. In the same way, we provide the algorithm of a further parameter that controls how much the learning process relies on the "semi-supervised information" and eventually try to optimize it. The study sets the basis for an extension to the semi-supervised case of the algorithms recently developed by the statistical physics community, based on the spectral properties of the Non-Backtracking and the Bethe-Hessian matrices.

Relators: Alfredo Braunstein, Romain Couillet, Lorenzo Dell'Amico
Academic year: 2021/22
Publication type: Electronic
Number of Pages: 32
Corso di laurea: Corso di laurea magistrale in Physics Of Complex Systems (Fisica Dei Sistemi Complessi)
Classe di laurea: New organization > Master science > LM-44 - MATHEMATICAL MODELLING FOR ENGINEERING
Ente in cotutela: Université Grenoble-Alpes, GIPSA-lab, pôle GAIA (équipe Apprentissage) (FRANCIA)
Aziende collaboratrici: Université Grenoble Alpes (UGA)
URI: http://webthesis.biblio.polito.it/id/eprint/20443
Modify record (reserved for operators) Modify record (reserved for operators)