polito.it
Politecnico di Torino (logo)

AI Copycats: Imitation Learning for Driving Style Modeling with Unity ML-Agents

Matteo Colucci

AI Copycats: Imitation Learning for Driving Style Modeling with Unity ML-Agents.

Rel. Francesco Strada, Andrea Bottino. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2024

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (14MB) | Preview
Abstract:

Latency mitigation (or compensation) is one of the main concerns when developing online applications that rely on real-time interaction between users, as latency constraints for highly dynamic applications (e.g., competitive First-Person Shooters or racing games) are very strict. In a server-authoritative setting — i.e., a client-server configuration in which the server has the final say on any performed action — Artificial Intelligence (AI) enables a new solution for reducing the round-trip time of packets to and from users experiencing high latency. This thesis' contribution, in collaboration with the MPAI (Moving Picture, audio and data coding by Artificial Intelligence) Community, is an application of an Imitation Learning approach to a custom-made kart racing videogame, with the purpose of showing a possible implementation of the SPG (Server-based Predictive Multiplayer Gaming) specification for the steps that concern a single user (namely data gathering, model training and evaluation). Imitation Learning is widely used in conjunction with Reinforcement Learning to train robotic agents. In contrast with a fully Reinforcement-Learning-based approach, demonstrations from a human "expert" are provided to the agent to take example from, which kick-start the following autonomous learning phase typical of Reinforcement Learning. For behavior modeling, however, Imitation Learning is used exclusively, training on users' in-game performances, with no following Reinforcement Learning. The resulting models, according to the MPAI-SPG specification, are then employed by the server in order to temporarily take control of an user's vehicle if they were to incur in high latency spikes. After a server intervention episode, a faithful model would allow for minimal reconciliation, and an ideally seamless experience for all other players. In addition to the aforementioned karting videogame, an in-editor framework was developed to aid in automating training and testing, and to ease interaction with the data-hosting server.

Relatori: Francesco Strada, Andrea Bottino
Anno accademico: 2023/24
Tipo di pubblicazione: Elettronica
Numero di pagine: 55
Soggetti:
Corso di laurea: Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering)
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA
Aziende collaboratrici: NON SPECIFICATO
URI: http://webthesis.biblio.polito.it/id/eprint/31925
Modifica (riservato agli operatori) Modifica (riservato agli operatori)