polito.it
Politecnico di Torino (logo)

Automated Quality Assurance with Multi-modal Large Language Models

Gabriele Sanmartino

Automated Quality Assurance with Multi-modal Large Language Models.

Rel. Luca Cagliero, Paolo Papotti. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2025

Abstract:

Quality Assurance (QA) is a critical aspect for a company whose product is an application directly used by customers. Traditional QA automation often struggles with dynamic interfaces and evolving scenarios, and it cannot effectively handle cases where identifying issues requires reasoning or contextual understanding. This thesis explores the integration of Large Language Models (LLMs) in a multi-modal setup to enhance QA automation, leveraging their contextual understanding of images and ability to extract meaningful information. The research focuses on three key phases: the experimentation phase to assess the current capabilities of state-of-the-art models in image analysis, the implementation of tests using real cases within the company's application, and the exploration of AI Agents to autonomously navigate the application and extract relevant information. The experiments are initially conducted on open-source datasets. Later, custom datasets are created using samples retrieved from the company's application to test the models' abilities in the specific use case of identifying bugs within the internal app. Given the promising results, specific scenarios within the application are selected for the implementation of tests powered by LLMs, with the aim of ensuring the ongoing consistency of both data and layout within the application for cases that cannot be handled by traditional QA. Experiments demonstrate that LLMs can significantly improve QA automation by reducing manual intervention, and the production implementation highlights the feasibility of integrating LLM-driven tests into existing workflows. The AI Agents are directly tested on the implemented tests within the application to assess the feasibility of replacing time-consuming and sometimes complex step implementations with an autonomous agent navigating the app. The Agents demonstrate both surprising abilities and limitations, but these emerging tools show great potential for the future. This work contributes to the AI4AI field, a key topic in the AI world and a central theme in the company's leading innovations, such as Aily. In particular, the thesis demonstrates the potential of LLMs to revolutionize QA testing by enabling more efficient workflows, handling more complex tasks and producing more accurate outcomes.

Relatori: Luca Cagliero, Paolo Papotti
Anno accademico: 2024/25
Tipo di pubblicazione: Elettronica
Numero di pagine: 97
Informazioni aggiuntive: Tesi secretata. Fulltext non presente
Soggetti:
Corso di laurea: Corso di laurea magistrale in Data Science And Engineering
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-32 - INGEGNERIA INFORMATICA
Ente in cotutela: INSTITUT EURECOM (FRANCIA)
Aziende collaboratrici: Aily Labs Iberia, S.L.U.
URI: http://webthesis.biblio.polito.it/id/eprint/35432
Modifica (riservato agli operatori) Modifica (riservato agli operatori)