Login
ENIT
WebThesis Logo Politecnico di Torino

Using Explainability Methods to Uncover Shortcuts in Language Models

Using Explainability Methods to Uncover Shortcuts in Language Models

Fabio Marmello

Using Explainability Methods to Uncover Shortcuts in Language Models.

Rel. Luca Vassio, Marco Mellia, Idilio Drago. Politecnico di Torino, Master of science program in Cybersecurity, 2025