Politecnico di Torino (logo)

Towards safe and efficient transfer of robot policies from simulation to real world

Gabriele Tiboni

Towards safe and efficient transfer of robot policies from simulation to real world.

Rel. Barbara Caputo, Ville Kyrki. Politecnico di Torino, Corso di laurea magistrale in Data Science and Engineering, 2021

PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (46MB) | Preview

Reinforcement learning approaches have demonstrated great promise for flexible and efficient robot learning. Yet, current data-driven algorithms require large amounts of data to learn even simple robotic tasks and leave open safety concerns when training on real hardware. Physics simulators offer several advantages in this regard, allowing to train robot policies entirely in simulation before the final deployment onto the real world. However, while this solves the problem of fast and safe data collection, any small inaccuracy or parameter discrepancy in simulation may potentially lead to policies which do not directly transfer to the real system. Domain randomization has recently gained a lot of traction as a method to overcome the reality gap experienced by transferred policies, encouraging robustness to domain shifts by randomizing physical parameters in the simulated scene. As current applications of this method require tedious manual engineering to find optimal randomization ranges, we introduce a novel algorithm to automatically identify the parameter distributions to train on, based on limited real-world data safely collected through human demonstrations. We show that the optimized distributions are capable of compensating for unmodelled phenomena in simulation. Furthermore, we evaluate our method on two real robots demonstrating a successful domain transfer and improved performance over prior methods.

Relators: Barbara Caputo, Ville Kyrki
Academic year: 2021/22
Publication type: Electronic
Number of Pages: 99
Corso di laurea: Corso di laurea magistrale in Data Science and Engineering
Classe di laurea: New organization > Master science > LM-32 - COMPUTER SYSTEMS ENGINEERING
Ente in cotutela: AALTO UNIVERSITY OF TECHNOLOGY - School of Science (FINLANDIA)
Aziende collaboratrici: Aalto University
URI: http://webthesis.biblio.polito.it/id/eprint/20542
Modify record (reserved for operators) Modify record (reserved for operators)