polito.it
Politecnico di Torino (logo)

Causality in Machine Learning and the Large Dimensional Limit

Theo Marchetta

Causality in Machine Learning and the Large Dimensional Limit.

Rel. Alfredo Braunstein, Cyril Furtlehner, Sergio Chibbaro. Politecnico di Torino, Corso di laurea magistrale in Physics Of Complex Systems (Fisica Dei Sistemi Complessi), 2024

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (2MB) | Preview
Abstract:

In this thesis, we investigate the incorporation of causality into machine learning models, particularly focusing on large dimensional data. The work starts with an exploration of linear regression within machine learning, highlighting its connections with linear response theory, a field of statistical physics. We derive a closed-form solution for the empirical response function, accounting for bias and variance. The study extends to high-dimensional data, addressing the complexities of identifying causal relationships and the curse of dimensionality. Numerical simulations show that observational causality can be inferred in high-dimensional contexts with linear dependencies among variables. This thesis bridges principles from statistical mechanics and machine learning, offering new perspectives on causal relationships in complex systems.

Relators: Alfredo Braunstein, Cyril Furtlehner, Sergio Chibbaro
Academic year: 2023/24
Publication type: Electronic
Number of Pages: 22
Subjects:
Corso di laurea: Corso di laurea magistrale in Physics Of Complex Systems (Fisica Dei Sistemi Complessi)
Classe di laurea: New organization > Master science > LM-44 - MATHEMATICAL MODELLING FOR ENGINEERING
Aziende collaboratrici: INRIA
URI: http://webthesis.biblio.polito.it/id/eprint/31719
Modify record (reserved for operators) Modify record (reserved for operators)