Post-modelling Explainability for Deep Reinforcement Learning

Antonin Louis Leon Poche

Post-modelling Explainability for Deep Reinforcement Learning.

Rel. Giovanni Squillero. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2021

Preview

PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.
Download (5MB) | Preview

Abstract:	Artificial Intelligence (AI) has developed tremendously in recent years, notably thanks to the advances in neural networks. However, the "black box" character of the latter has slowed down the diffusion of Deep Learning (DL) in the industry. Indeed, despite the growing efficiency of neural networks, they still do not have the confidence of industrials. This is why explainability is a rapidly expanding research sector. Delfox is working on Deep Reinforcement Learning (DRL) for important industrial actors working in particular with critical systems. Explainability applied to Reinforcement Learning (RL) is therefore a key issue for Delfox and thus it is the focus of this internship. Explainability is still a young field of research and there is no industrial application of such a technology known to date. Hence the challenge of Delfox is to make that happen, they also need to show that their AIs are reliable. This report presents an exhaustive bibliography, a taxonomy of the eXplainable Artificial Intelligence (XAI) methods applicable to RL and the methods from XRL. From this bibliography, three methods merged, they have been studied and applied on a project. This report presents the methods called Feature Relevance (FR), Observation Clustering (OC) and Probe Sensing (PS). They allow to generate complementary explanations of the decisions and behavior of an Artificial Intelligence (AI) of RL.
Relatori:	Giovanni Squillero
Anno accademico:	2020/21
Tipo di pubblicazione:	Elettronica
Numero di pagine:	62
Soggetti:
Corso di laurea:	Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering)
Classe di laurea:	Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA
Aziende collaboratrici:	DELFOX PREDICTIVE TECHNOLOGIES
URI:	http://webthesis.biblio.polito.it/id/eprint/19264

Modifica (riservato agli operatori)