Edoardo Roba
Leap: a Model-Based Reinforcement Learning Framework for Fast Object Detection.
Rel. Andrea Giuseppe Bottino. Politecnico di Torino, Corso di laurea magistrale in Mechatronic Engineering (Ingegneria Meccatronica), 2020
|
Preview |
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (5MB) | Preview |
Abstract
The goal of the project was to create a new algorithm for Object Detection. The starting point of the project was exposed in "Active Object Localization with Deep Reinforcement Learning", where they described an OD algorithm based on Deep Reinforcement Learning and Markov Decision Process. Since every action taken by the Agent, the algorithm compares the current state with the environment, it takes a long time for the computations, because every state is encoded through a CNN, which is a Neural Network with over 58 million parameters (VGG16-like). The purpose of this project was to design a Model-Based algorithm to bypass the CNN.
During the Q-training, many transitions (state-next_state-action) are recorded and are used as training dataset for the predictive NN
Relatori
Tipo di pubblicazione
URI
![]() |
Modifica (riservato agli operatori) |
