polito.it
Politecnico di Torino (logo)

Combining link keys and similarity-based approaches to data interlinking

Giorgio Orioles

Combining link keys and similarity-based approaches to data interlinking.

Rel. Paolo Garza. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2023

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (1MB) | Preview
Abstract:

Data interlinking plays a crucial role in expanding and enriching linked open data. One approach to address data interlinking involves the utilization of link keys, which extends the concept of keys to scenarios where two RDF datasets are described using distinct ontologies. An alternative approach employs various similarities measures to evaluate the proximity of instances and decide if they have to be linked. In this work, we will employ numerical link specifications and document similarity. Both approaches have their respective advantages and limitations, and none of them provide a comprehensive solution for all interlinking challenges. Hence, this master thesis considers ways to combine such approaches elegantly, using the combination operators disjunction and injective complementation. We will make use of pairs of datasets to perform experiments in order to analyze the performance of different combination methods. The results show that the combination of the two approaches was productive, although it is very depending on the task we are analyzing.

Relators: Paolo Garza
Academic year: 2023/24
Publication type: Electronic
Number of Pages: 76
Subjects:
Corso di laurea: Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering)
Classe di laurea: New organization > Master science > LM-32 - COMPUTER SYSTEMS ENGINEERING
Ente in cotutela: INSTITUT NATIONAL POLYTECHNIQUE DE GRENOBLE (INPG) - ENSIMAG (FRANCIA)
Aziende collaboratrici: INRIA
URI: http://webthesis.biblio.polito.it/id/eprint/28476
Modify record (reserved for operators) Modify record (reserved for operators)