Personal Data Detection in Free Text
Gabriele Gioetto
Personal Data Detection in Free Text.
Rel. Giuseppe Rizzo. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2023
|
Preview |
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (4MB) | Preview |
Abstract
An HR (Human Resources) department in a large organization receives inquiries/requests from employees on multiple topics, which are quite different from one another. As an example, an employee can send requests dealing with health conditions, compensation/taxation, events of life (marriage, death of a relative. . . ). These data can be used for many different queries that can be useful for analysis purposes (Example: ‘How many people have had COVID during 2021‘). However, HR tickets typically contain personal data, that cannot be processed without the consent of the data subject according to the European privacy regulation (GDPR). To be able to process documents with personal data, we can identify the pieces of information that qualify as personal data in a communication and subsequently anonymize such information using the appropriate techniques.
A significant part of this problem is represented by the complex nature of personal data according to GDPR: personal data are defined as ‘any piece of information that can be connected to an identified or identifiable natural person‘
Tipo di pubblicazione
URI
![]() |
Modifica (riservato agli operatori) |
