polito.it
Politecnico di Torino (logo)

Portfolio Management: a Deep Reinforcement Learning Approach

Yi Yu Ivan Chen

Portfolio Management: a Deep Reinforcement Learning Approach.

Rel. Paolo Brandimarte, Edoardo Fadda, Carlo Sgarra. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Matematica, 2022

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (5MB) | Preview
Abstract:

Portfolio management is the task of constantly redistributing the wealth of an investor into different financial products known as assets in order to maximize the overall profit during the trading period while maintaining an acceptable level of risk. Traditionally, this problem has been heavily studied using a static approach, leading to the creation of what is known as the Modern Portfolio Theory by economist H. Markowitz in 1952. However, more recent developments in the field of optimization of dynamical systems, especially via reinforcement learning, have sparked a growing interest in applying these new techniques to the portfolio optimization task. In this thesis, we represent the trading environment as a discrete-time stochastic dynamical system and apply deep learning based reinforcement learning algorithms to train an agent to learn a trading strategy that optimizes a given objective. In particular, we assume that the agent has information only on the daily open, high, low and close prices and the trading volumes and, using these values, the trading agent has to decide, at the end of every trading day, what portfolio to build for the next day, keeping into account that all transactions are penalized due to the presence of a transaction cost equal to a small percentage of the total transaction value. Regarding the reinforcement learning algorithms employed, we focused only on actor-critic methods using LSTM-based architectures to represent the value and policy functions. These techniques are further improved via a pre-training step by initializing the networks weights using the optimal weights obtained by training a network with the same structure to forecast portfolio returns. Multiple experiments done using real market data consisting of a small set of stocks chosen from those included in the S&P500 index show the validity of applying the reinforcement learning framework to the task of portfolio optimization.

Relatori: Paolo Brandimarte, Edoardo Fadda, Carlo Sgarra
Anno accademico: 2022/23
Tipo di pubblicazione: Elettronica
Numero di pagine: 86
Soggetti:
Corso di laurea: Corso di laurea magistrale in Ingegneria Matematica
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-44 - MODELLISTICA MATEMATICO-FISICA PER L'INGEGNERIA
Aziende collaboratrici: ORS GROUP GMBH
URI: http://webthesis.biblio.polito.it/id/eprint/24871
Modifica (riservato agli operatori) Modifica (riservato agli operatori)