Politecnico di Torino (logo)

Semantic Annotation of Clinical Notes

Jeanpierre Francois

Semantic Annotation of Clinical Notes.

Rel. Maurizio Morisio, Giuseppe Rizzo. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2020

PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (2MB) | Preview

This master thesis work outlines the approach chosen to manage the extraction of information from unstructured and non standardized Italian electronic health records of breast cancer summaries. Our problem translates into a Named Entity Recognition activity that faces completely original classes that do not overlap at all with those normally studied in the literature. Taking into account feasibility and speed, we propose a solution that applies natural language processing procedures to fine-tune a statistical model trained onto a narrow set of annotated data. In order to obtain a gold standard we collaborated with domain experts to manually annotate the data collection. We defined with clinicians an annotation protocol and, to reduce the necessary human effort and speed up the process, customized a user friendly web-based annotation tool.

Relators: Maurizio Morisio, Giuseppe Rizzo
Academic year: 2019/20
Publication type: Electronic
Number of Pages: 82
Corso di laurea: Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering)
Classe di laurea: New organization > Master science > LM-32 - COMPUTER SYSTEMS ENGINEERING
Aziende collaboratrici: FONDAZIONE LINKS
URI: http://webthesis.biblio.polito.it/id/eprint/16240
Modify record (reserved for operators) Modify record (reserved for operators)