polito.it
Politecnico di Torino (logo)

Natural Language Models for Querying Database

Mohamed Khaled Hassan Aly Motrash

Natural Language Models for Querying Database.

Rel. Alessandro Aliberti, Edoardo Patti, Lorenzo Bottaccioli, Marco Castangia. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2024

[img] PDF (Tesi_di_laurea) - Tesi
Accesso riservato a: Solo utenti staff fino al 13 Giugno 2026 (data di embargo).
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (2MB)
Abstract:

This thesis explored the application of Large Language Models (LLMs) to automate SQL query generation, aiming to bridge the gap between natural language and database interaction. By Exploring different LLMs and analyzing their architecture and their ability to understand different large language and manage to produce a corresponding SQL query that fulfils the task. In specific several LLM were explored to achieve our goal of finding the most suitable LLM for the task and among these models were the Deepseek7B and Mistral7B which have shown more advanced results compared to others. Specifically by separating the problem into 2 tasks we have managed to increase the overall performance such that for the first task it will be related to selecting the most suitable schema for the tables for the asked question and the second task will be for using that of the first task along with the question to get the final output. Along the study we have managed to get very close to the state of the art reported by closed models and much bigger sized models to the extent that we could actually depend on it for commercial use.

Relatori: Alessandro Aliberti, Edoardo Patti, Lorenzo Bottaccioli, Marco Castangia
Anno accademico: 2024/25
Tipo di pubblicazione: Elettronica
Numero di pagine: 67
Soggetti:
Corso di laurea: Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering)
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA
Aziende collaboratrici: NON SPECIFICATO
URI: http://webthesis.biblio.polito.it/id/eprint/33835
Modifica (riservato agli operatori) Modifica (riservato agli operatori)