polito.it
Politecnico di Torino (logo)

Stemming and its evaluation in search engines Case of study: Salesforce search engine

Rosetta Pagliuca

Stemming and its evaluation in search engines Case of study: Salesforce search engine.

Rel. Paolo Garza. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2021

Abstract:

In the last years, due to the increase of the amount of data and size of information collections, Information Retrieval methods have been enhanced and widely studied, in order to assure users satisfaction. Several mechanism have been developed for obtaining easily and quickly the right information. One useful operation for this type of systems is called stemming. It consists in the reduction of the word to its base form. In fact, words present in documents and queries have a large number of morphological variants. Stemming application is the way for linking together words sharing the same base form, increasing the number of documents retrieved by search engines. This project is developed studying specifically search engine of CRM application Salesforce. Its aim is to analyze the impact of stemming on this search engine, in terms of resource consumption - query time and memory - and relevance. Moreover, this analysis has been extended on several stemming algorithms and on the ranking step of retrieval process in order to propose, at the end, an alternative to the existing stemming system.

Relatori: Paolo Garza
Anno accademico: 2020/21
Tipo di pubblicazione: Elettronica
Numero di pagine: 75
Informazioni aggiuntive: Tesi secretata. Fulltext non presente
Soggetti:
Corso di laurea: Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering)
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA
Ente in cotutela: INSTITUT NATIONAL POLYTECHNIQUE DE GRENOBLE (INPG) - ENSIMAG (FRANCIA)
Aziende collaboratrici: Salesforce.com
URI: http://webthesis.biblio.polito.it/id/eprint/18119
Modifica (riservato agli operatori) Modifica (riservato agli operatori)