Christian Paesante
Bridge Aware Clustering with Noise Detection.
Rel. Paolo Garza, Luca Cagliero. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2021
|
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (1MB) | Preview |
|
Archive (ZIP) (Documenti_allegati)
- Other
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (643kB) |
Abstract: |
This work focuses on improving a density-based algorithm called Bridge Aware Clustering in terms of scalability, cluster fragmentation robustenss and noise ro-bustness. The scalability improvements were conducted by implementing a distributed version of Bridge Aware Clustering on Spark using Python.The cluster fragmentation robustness improvements were conducted by integrating a cluster fusion technique documented in the literature and testing it over different benchmark datasets. The noise robustness improvements were conducted by integrating a noise detection method through an extensive testing campaign aiming to evaluate the noise robustness over different levels of noise. |
---|---|
Relators: | Paolo Garza, Luca Cagliero |
Academic year: | 2020/21 |
Publication type: | Electronic |
Number of Pages: | 34 |
Subjects: | |
Corso di laurea: | Corso di laurea magistrale in Data Science And Engineering |
Classe di laurea: | New organization > Master science > LM-32 - COMPUTER SYSTEMS ENGINEERING |
Aziende collaboratrici: | UNSPECIFIED |
URI: | http://webthesis.biblio.polito.it/id/eprint/19179 |
Modify record (reserved for operators) |