polito.it
Politecnico di Torino (logo)

Bridge Aware Clustering with Noise Detection

Christian Paesante

Bridge Aware Clustering with Noise Detection.

Rel. Paolo Garza, Luca Cagliero. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2021

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (1MB) | Preview
[img] Archive (ZIP) (Documenti_allegati) - Altro
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (643kB)
Abstract:

This work focuses on improving a density-based algorithm called Bridge Aware Clustering in terms of scalability, cluster fragmentation robustenss and noise ro-bustness. The scalability improvements were conducted by implementing a distributed version of Bridge Aware Clustering on Spark using Python.The cluster fragmentation robustness improvements were conducted by integrating a cluster fusion technique documented in the literature and testing it over different benchmark datasets. The noise robustness improvements were conducted by integrating a noise detection method through an extensive testing campaign aiming to evaluate the noise robustness over different levels of noise.

Relatori: Paolo Garza, Luca Cagliero
Anno accademico: 2020/21
Tipo di pubblicazione: Elettronica
Numero di pagine: 34
Soggetti:
Corso di laurea: Corso di laurea magistrale in Data Science And Engineering
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA
Aziende collaboratrici: NON SPECIFICATO
URI: http://webthesis.biblio.polito.it/id/eprint/19179
Modifica (riservato agli operatori) Modifica (riservato agli operatori)