Logo Politecnico di Torino
ITEN
WebThesis

Using Explainability Methods to Uncover Shortcuts in Language Models

Fabio Marmello

Using Explainability Methods to Uncover Shortcuts in Language Models.

Rel. Luca Vassio, Marco Mellia, Idilio Drago. Politecnico di Torino, Corso di laurea magistrale in Cybersecurity, 2025