Logo Politecnico di Torino
ITEN
WebThesis

Post Training Low Rank Approximation for KV Cache Compression in Large Language Models

Andrea Vannozzi

Post Training Low Rank Approximation for KV Cache Compression in Large Language Models.

Rel. Daniele Jahier Pagliari, Alessio Burrello, Luca Benfenati. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2026