Enrico Alfonso Girardi
Automatic slide generation from scientific papers based on multimodal learning.
Rel. Luca Cagliero, Moreno La Quatra. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2022
|
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (4MB) | Preview |
Abstract: |
Text Summarization for scientific papers is a very active topic of research with many state of the art solutions but its application in a Multimodal Setting, i.e., summarize the paper content in slides by considering also image content is a recent field of research with only few papers that tackle this task. The purpose of this work is to produce presentation slide for scientific papers by using a state of the art solution in the field of Extractive Text Summarization (only text) combined with a Multimodal model capable to learn visual concepts from images. Furthermore the aim of this report is to be an useful lecture for anyone that want to approach this task and help the production of future works. |
---|---|
Relators: | Luca Cagliero, Moreno La Quatra |
Academic year: | 2022/23 |
Publication type: | Electronic |
Number of Pages: | 57 |
Subjects: | |
Corso di laurea: | Corso di laurea magistrale in Data Science And Engineering |
Classe di laurea: | New organization > Master science > LM-32 - COMPUTER SYSTEMS ENGINEERING |
Aziende collaboratrici: | UNSPECIFIED |
URI: | http://webthesis.biblio.polito.it/id/eprint/24617 |
Modify record (reserved for operators) |