Simona Fiorito
Variable importance in modern regression with application to supplier productivity analysis.
Rel. Mauro Gasparini. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Matematica, 2018
|
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (1MB) | Preview |
Abstract: |
In the last decades the amount of available data increased extremely and the data analytics field evolved and expanded accordingly. Starting from linear regression many other algorithms have been developed, leading to the spread of machine learning methods. Such methods are able to implement more complex regression models, achieving higher accuracy, but often at the cost of interpretability. In this master's thesis a study about the interpretation of some regression models has been carried out. Methods to evaluate variable importance and to display efficiently feature effects on the variable of interest are explained; the respectively R functions that implement such methods are illustrated through the presentation of a case study developed by the author during an internship in Tetra Pak. |
---|---|
Relators: | Mauro Gasparini |
Academic year: | 2018/19 |
Publication type: | Electronic |
Number of Pages: | 79 |
Subjects: | |
Corso di laurea: | Corso di laurea magistrale in Ingegneria Matematica |
Classe di laurea: | New organization > Master science > LM-44 - MATHEMATICAL MODELLING FOR ENGINEERING |
Aziende collaboratrici: | Tetra Pak Packagin Solutions S.p.A. |
URI: | http://webthesis.biblio.polito.it/id/eprint/9930 |
Modify record (reserved for operators) |