Mahsa Farjoo
Automated forms cleaning by considering forms with coloured content and coloured backgrounds.
Rel. Alessandro Savino. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2023
|
Preview |
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (13MB) | Preview |
Abstract
The Main purpose of this project, extraction/erasing of information filled forms. As you know, forms include dynamic contents and static contents. When a form is completed by customers/ clients, static contents are the same, but the dynamic of the contents are different for each client/customer. Since this real dataset is scarce, I used the simulation method to generate fake dataset. These Empty forms are in PDF format. So, I need first convert these empty forms from PDF to PNG, then generate the dynamic content of forms and insert them on different locations of the form. For generating dynamic content of form, First I have to understand which dynamic content or which dynamic data require, and then with using fake python library generate random dynamic data.
For instance, for filling the empty form, the python library needs to generate some German full-name, address, Bank account information, date of birth, job title, Email and some random text
Relatori
Anno Accademico
Tipo di pubblicazione
Numero di pagine
Corso di laurea
Classe di laurea
Aziende collaboratrici
URI
![]() |
Modifica (riservato agli operatori) |
