polito.it
Politecnico di Torino (logo)

Clustering feedback data with Logistic Network Lasso

Simone Desogus

Clustering feedback data with Logistic Network Lasso.

Rel. Barbara Caputo. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2020

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (3MB) | Preview
Abstract:

Feedback gathered from university courses' are the most effective tool given to students to shape lectures according to their needs and difficulties. Nonetheless, feedback is often neglected by both students and teachers, even if studies show how feedback can improve the quality of the student's experience as well as being an essential tool for the teacher's development. With the improvement in Natural Language Processing and the Machine Learning field, it is now possible to analyze text and effortlessly gather insights. These insights ease the professor's work when faced with the task of reading feedback which, depending on the course, can be thousands. The Logistic Network Lasso is a novel semi-supervised machine learning algorithm that has been applied in the binary classification scenario and compared to algorithms such as Maxflow and Belief Propagation, resulting in overall increased accuracy. This thesis aims to answer two research questions: Firstly, the performance of various standard clustering algorithms such as K-Means, Affinity Propagation, Spectral Clustering and DBSCAN, are compared using different Natural Language Processing techniques to encode university courses' feedback, showing which embedding techniques are better in terms of clustering feedback data. Secondly, the standard clustering algorithms are compared with the Logistic Network Lasso incorporating the best embedding techniques discovered during the first research question. Applying the Logistic Network Lasso as a clustering algorithm resulted in up to a 30% increase in the external indexes clustering performance metrics that were analyzed.

Relatori: Barbara Caputo
Anno accademico: 2019/20
Tipo di pubblicazione: Elettronica
Numero di pagine: 77
Soggetti:
Corso di laurea: Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering)
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA
Ente in cotutela: Aalto University (FINLANDIA)
Aziende collaboratrici: NON SPECIFICATO
URI: http://webthesis.biblio.polito.it/id/eprint/14510
Modifica (riservato agli operatori) Modifica (riservato agli operatori)