Deep Reinforcement Learning for Dynamic Job-Shop Scheduling in High-Utilization Systems

Valerio Firmano

Deep Reinforcement Learning for Dynamic Job-Shop Scheduling in High-Utilization Systems.

Rel. Paolo Brandimarte, Edoardo Fadda. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Matematica, 2024

Preview

PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.
Download (1MB) | Preview

Abstract:	This thesis presents the development and evaluation of a Double Deep Q-learning Network (DDQN) agent for addressing the Dynamic Job-Shop Scheduling Problem (DJSP) in high-utilization systems. The DDQN agent is designed to optimize job scheduling dynamically, taking into account stochastic arrivals and stochastic jobs attributes. By employing separate networks for action selection and evaluation, the DDQN mitigates overestimation bias, enhancing the stability and accuracy of the learning process. Preliminary results indicate that the DDQN agent performs well in highly utilized systems, demonstrating significant promise in optimizing scheduling efficiency, though its performance is comparable to some well-tailored traditional heuristics. However, its performance in less utilized systems remains less effective, suggesting room for further refinement. The findings highlight the potential of reinforcement learning techniques in complex, dynamic industrial environments and open the way for future advancements in adaptive scheduling solutions.
Relatori:	Paolo Brandimarte, Edoardo Fadda
Anno accademico:	2023/24
Tipo di pubblicazione:	Elettronica
Numero di pagine:	78
Soggetti:
Corso di laurea:	Corso di laurea magistrale in Ingegneria Matematica
Classe di laurea:	Nuovo ordinamento > Laurea magistrale > LM-44 - MODELLISTICA MATEMATICO-FISICA PER L'INGEGNERIA
Aziende collaboratrici:	NON SPECIFICATO
URI:	http://webthesis.biblio.polito.it/id/eprint/31608

Modifica (riservato agli operatori)