polito.it
Politecnico di Torino (logo)

Natural Language Processing with Generative AI Models: A Methodological Approach for Their Application

Stefano Strippoli

Natural Language Processing with Generative AI Models: A Methodological Approach for Their Application.

Rel. Fabrizio Lamberti. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2023

[img]
Preview
PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (2MB) | Preview
Abstract:

In this thesis work, a methodological approach is proposed for application in Natural Language Processing (NLP) tasks exploiting the capabilities of latest Generative Artificial Intelligence (Generative AI) text-to-text models. With this methodology, it will be possible to delineate data features scope effectively, enhance model prompting based on expected input and output, and appropriately select the most suitable Generative model for the required NLP task. The dissertation begins with an analysis of the foundations of the subject, providing an overview of both NLP architectures, from Recurrent Neural Networks to today’s Transformer, and Generative AI state-of-the-art models from GoogleAI, MetaAI and OpenAI. Then the methodology is presented describing its phases, the different options to treat different features of data (such as data sensitivity and input formats) along with the generative model selection process and performance evaluation criteria. Next the methodology is applied to 10 different NLP tasks, from simple comprehension tasks to a difficult and temporally articulated assignment involving a real robotic agent such as Boston Dynamics’ SPOT. The effectiveness of the methodology is discussed, evaluating its performance, underlining strengths, weaknesses and possible ways to solve them. Finally, future developments and solutions, helpful in enhancing the linguistic comprehensions of Generative models, are presented.

Relators: Fabrizio Lamberti
Academic year: 2023/24
Publication type: Electronic
Number of Pages: 73
Subjects:
Corso di laurea: Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering)
Classe di laurea: New organization > Master science > LM-32 - COMPUTER SYSTEMS ENGINEERING
Aziende collaboratrici: Reply Consulting Srl
URI: http://webthesis.biblio.polito.it/id/eprint/28973
Modify record (reserved for operators) Modify record (reserved for operators)