Fabio Caffaro
Hierarchical Dense Knowledge Retrieval for Knowledge-enhanced Conversational Agents.
Rel. Giuseppe Rizzo. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2022
|
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (6MB) | Preview |
Abstract: |
In this work will be presented a solution developed to optimize in terms of computational time, the process of knowledge retrieval adopted to extract relevant unstructured documents from a knowledge base given an input query. The experiment relies on the setup delineated in the first track of the ninth Dialog System Technology Challenge (DSTC 9). The goal of this task is to design a frictionless knowledge-enhanced conversational agent able to deal with out-of-scope requests that cannot be addressed simply with the call of an API service but that require access to external knowledge. The knowledge base consists of unstructured textual documents collected from the Frequently Asked Question (FAQ) pages of several entities belonging to five domains. From an analysis of the results of the DSTC9, the knowledge retrieval step resulted as a crucial phase in the knowledge-enhancing process. Indeed, the results in this sub-task showed the highest correlation with human judgment and the model that performed the best in this sub-task resulted also as the winner in the final evaluation. However, the best performing models in the retrieval step are based on the Passage Re-Ranking strategy. This strategy requires a point-wise evaluation of all the knowledge documents, causing the time complexity of the system to scale linearly with the dimension of the knowledge base becoming soon unapplicable to real-case scenarios. The method developed is based on a Hierarchical Dense Knowledge Retrieval system that exploits the hierarchical structure of the documents present in the knowledge base in order to perform computationally efficient knowledge retrieval. |
---|---|
Relatori: | Giuseppe Rizzo |
Anno accademico: | 2021/22 |
Tipo di pubblicazione: | Elettronica |
Numero di pagine: | 86 |
Soggetti: | |
Corso di laurea: | Corso di laurea magistrale in Data Science And Engineering |
Classe di laurea: | Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA |
Aziende collaboratrici: | FONDAZIONE LINKS-LEADING INNOVATION & KNOWLEDGE |
URI: | http://webthesis.biblio.polito.it/id/eprint/23896 |
Modifica (riservato agli operatori) |