polito.it
Politecnico di Torino (logo)

Bridge Aware Clustering with Noise Detection

Christian Paesante

Bridge Aware Clustering with Noise Detection.

Rel. Paolo Garza, Luca Cagliero. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2021

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (1MB) | Preview
[img] Archive (ZIP) (Documenti_allegati) - Other
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (643kB)
Abstract:

This work focuses on improving a density-based algorithm called Bridge Aware Clustering in terms of scalability, cluster fragmentation robustenss and noise ro-bustness. The scalability improvements were conducted by implementing a distributed version of Bridge Aware Clustering on Spark using Python.The cluster fragmentation robustness improvements were conducted by integrating a cluster fusion technique documented in the literature and testing it over different benchmark datasets. The noise robustness improvements were conducted by integrating a noise detection method through an extensive testing campaign aiming to evaluate the noise robustness over different levels of noise.

Relators: Paolo Garza, Luca Cagliero
Academic year: 2020/21
Publication type: Electronic
Number of Pages: 34
Subjects:
Corso di laurea: Corso di laurea magistrale in Data Science And Engineering
Classe di laurea: New organization > Master science > LM-32 - COMPUTER SYSTEMS ENGINEERING
Aziende collaboratrici: UNSPECIFIED
URI: http://webthesis.biblio.polito.it/id/eprint/19179
Modify record (reserved for operators) Modify record (reserved for operators)