![]() | Livello precedente |
Francesco Giacometti.
Adaptivity of Markovian and History-Based Reinforcement Learning Policies in Environments with Latent Dynamic Parameters.
Rel. Giuseppe Bruno Averta, Gabriele Tiboni. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2025