Rosetta Pagliuca
Stemming and its evaluation in search engines Case of study: Salesforce search engine.
Rel. Paolo Garza. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2021
Abstract: |
In the last years, due to the increase of the amount of data and size of information collections, Information Retrieval methods have been enhanced and widely studied, in order to assure users satisfaction. Several mechanism have been developed for obtaining easily and quickly the right information. One useful operation for this type of systems is called stemming. It consists in the reduction of the word to its base form. In fact, words present in documents and queries have a large number of morphological variants. Stemming application is the way for linking together words sharing the same base form, increasing the number of documents retrieved by search engines. This project is developed studying specifically search engine of CRM application Salesforce. Its aim is to analyze the impact of stemming on this search engine, in terms of resource consumption - query time and memory - and relevance. Moreover, this analysis has been extended on several stemming algorithms and on the ranking step of retrieval process in order to propose, at the end, an alternative to the existing stemming system. |
---|---|
Relatori: | Paolo Garza |
Anno accademico: | 2020/21 |
Tipo di pubblicazione: | Elettronica |
Numero di pagine: | 75 |
Informazioni aggiuntive: | Tesi secretata. Fulltext non presente |
Soggetti: | |
Corso di laurea: | Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering) |
Classe di laurea: | Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA |
Ente in cotutela: | INSTITUT NATIONAL POLYTECHNIQUE DE GRENOBLE (INPG) - ENSIMAG (FRANCIA) |
Aziende collaboratrici: | Salesforce.com |
URI: | http://webthesis.biblio.polito.it/id/eprint/18119 |
Modifica (riservato agli operatori) |