Dynamic identification of risk thresholds for balance measures in machine learning

Andrea Adrignola

Dynamic identification of risk thresholds for balance measures in machine learning.

Rel. Antonio Vetro', Mariachiara Mecati. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2022

Preview

PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.
Download (3MB) | Preview

Abstract

Automated decision-making systems (ADM) may significantly affect our everyday life. They can assist us in a number of tasks when used as a reference, or even substitute humans entirely, and they are as much an opportunity to challenge our decision-making processes as they are a mean of reinforcing pre-existing biases. Because even if algorithms are mostly neutral, the data used to train them could (and usually do) encode social biases. For this reason, one of the main approaches to mitigate bias in such a framework is to work on data quality. To assess how the quality of the data affects the outcome of a classification, we made use of two different set of indices, balance measures and fairness measures, relating to different stages of a machine learning pipeline.

Balance measures assess the proportions of classes of a given sensitive attribute (training set), while fairness measures evaluate the fairness of the outcome, in our case a classification (test set), with respect to the same attribute

Relatori

Antonio Vetro', Mariachiara Mecati

Anno Accademico

2021/22

Tipo di pubblicazione

Elettronica

Numero di pagine

Corso di laurea

Corso di laurea magistrale in Data Science And Engineering

Classe di laurea

Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA

URI

https://webthesis.biblio.polito.it/id/eprint/23512

Modifica (riservato agli operatori)