Giorgio Orioles
Combining link keys and similarity-based approaches to data interlinking.
Rel. Paolo Garza. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2023
|
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (1MB) | Preview |
Abstract: |
Data interlinking plays a crucial role in expanding and enriching linked open data. One approach to address data interlinking involves the utilization of link keys, which extends the concept of keys to scenarios where two RDF datasets are described using distinct ontologies. An alternative approach employs various similarities measures to evaluate the proximity of instances and decide if they have to be linked. In this work, we will employ numerical link specifications and document similarity. Both approaches have their respective advantages and limitations, and none of them provide a comprehensive solution for all interlinking challenges. Hence, this master thesis considers ways to combine such approaches elegantly, using the combination operators disjunction and injective complementation. We will make use of pairs of datasets to perform experiments in order to analyze the performance of different combination methods. The results show that the combination of the two approaches was productive, although it is very depending on the task we are analyzing. |
---|---|
Relators: | Paolo Garza |
Academic year: | 2023/24 |
Publication type: | Electronic |
Number of Pages: | 76 |
Subjects: | |
Corso di laurea: | Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering) |
Classe di laurea: | New organization > Master science > LM-32 - COMPUTER SYSTEMS ENGINEERING |
Ente in cotutela: | INSTITUT NATIONAL POLYTECHNIQUE DE GRENOBLE (INPG) - ENSIMAG (FRANCIA) |
Aziende collaboratrici: | INRIA |
URI: | http://webthesis.biblio.polito.it/id/eprint/28476 |
Modify record (reserved for operators) |