Riccardo Prestigiacomo
ADVANCEMENTS IN TOPIC MODELING TECHNIQUES: A COMPREHENSIVE STUDY ON ALGORITHM COMPARISON, AND NOVEL METRICS FOR OUTCOME EVALUATION, USING SOCIAL MEDIA DATA.
Rel. Paolo Garza. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2023
Abstract
This master's thesis, conducted in the company Claravista, a consulting firm based in Paris specializing in digital marketing and data science, aims to comprehensively compare various topic modeling algorithms. The primary focus of this research is on natural language processing, specifically delving into topic modeling techniques. In an era where a staggering volume of information is generated daily by individuals across social media platforms, the potential inherent in analyzing this wealth of data resonates profoundly across numerous domains. Especially in the fast-paced world of digital marketing, where Claravista works, this kind of analysis becomes even more important. Throughout this study, three distinct machine learning algorithms—namely, Latent Dirichlet Allocation (LDA), Top2Vec, and BERTopic—are evaluated in the context of topic modeling.
The primary data source for this analysis is social media content
Relatori
Anno Accademico
Tipo di pubblicazione
Numero di pagine
Informazioni aggiuntive
Corso di laurea
Classe di laurea
Ente in cotutela
URI
![]() |
Modifica (riservato agli operatori) |
