polito.it
Politecnico di Torino (logo)

Tailoring the Knowledge Data Discovery process to e-commerce reviews. = Tailoring the Knowledge Data Discovery process to e-commerce reviews.

Francesca Gubbiotti

Tailoring the Knowledge Data Discovery process to e-commerce reviews. = Tailoring the Knowledge Data Discovery process to e-commerce reviews.

Rel. Tania Cerquitelli, Evelina Di Corso. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Gestionale, 2019

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (6MB) | Preview
Abstract:

This thesis mainly concerns data driven technologies and the related potentiality to bring out, from textual data, previously unknown knowledge. The goal, for the current work case, is to extract potentially useful information from Amazon products reviews. To this aim, Knowledge Data Discovery (KDD) process, applied on textual data collection, has been tailored to Amazon reviews dataset. First of all a broad description of text mining with its benefits and pitfalls has been introduced along with existing algorithm and methodologies; then, ESCAPE engine has been studied, tailored and proposed as a not-time consuming and low-computational cost solution. The proposed tool approaches and integrates all the building blocks of KDD processes such as data processing and characterization, self-Tuning exploratory data analytics, and knowledge validation and visualization. Two different approaches with self-tuning algorithms for the exploratory phase are also included. Before running the experiments, Amazon Web Service platform (AWS) case and the importance of data analysis for business choice have been discussed. A large number of experiments, applying the two approaches, have been performed on almost twenty products reviews dataset, some of them specifically built during the development of the thesis according to the work needs. Experimental results have been finally analysed, validated and visualized with several techniques in order to show, from a technical point of view, the performances of both the two ESCAPE approaches and the strategies used within them, and, from a business analysis point of view, interesting features among the different products categories comments.

Relatori: Tania Cerquitelli, Evelina Di Corso
Anno accademico: 2019/20
Tipo di pubblicazione: Elettronica
Numero di pagine: 100
Soggetti:
Corso di laurea: Corso di laurea magistrale in Ingegneria Gestionale
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-31 - INGEGNERIA GESTIONALE
Ente in cotutela: RUG - Universiteit Gent (BELGIO)
Aziende collaboratrici: NON SPECIFICATO
URI: http://webthesis.biblio.polito.it/id/eprint/13481
Modifica (riservato agli operatori) Modifica (riservato agli operatori)