Abdessamed Qchohi
On the Role of Uncertainty in In-Context Grokking: a Bayesian Approach.
Rel. Paolo Garza, Simone Rossi. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2026
|
|
PDF (Tesi_di_laurea)
- Tesi
Accesso limitato a: Solo utenti staff fino al 27 Marzo 2027 (data di embargo). Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (11MB) |
Abstract
Understanding how neural networks shift from memorization to generalization is crucial to modern AI, particularly in the context of emergent behaviors such as In-Context Learning (ICL) and grokking. While prior work has studied these phenomena mainly through accuracy and structural analysis, the role of uncertainty in shaping this transition has been largely overlooked. In this work, we present a probabilistic framework to investigate ICL and grokking from a Bayesian perspective, focusing on how uncertainty evolves during training and influences generalization dynamics. First, we develop a reproducible experimental pipeline based on modular arithmetic tasks, enabling controlled evaluation of in-distribution and out-of-distribution generalization in Transformer models.
Then, we integrate approximate Bayesian inference methods —including Monte-Carlo Dropout, Laplace Approximation, and Improved Variational Online Newton (IVON)— to quantify predictive, epistemic, and aleatoric uncertainty throughout training
Relatori
Anno Accademico
Tipo di pubblicazione
Numero di pagine
Corso di laurea
Classe di laurea
Ente in cotutela
Aziende collaboratrici
URI
![]() |
Modifica (riservato agli operatori) |
