polito.it
Politecnico di Torino (logo)

Natural Language Generation for Automated Sports Broadcasting

Davide Omento

Natural Language Generation for Automated Sports Broadcasting.

Rel. Riccardo Coppola, Anna Arnaudo. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Matematica, 2025

[img] PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.

Download (1MB)
Abstract:

This thesis addresses the design and implementation of an intelligent system built on a Retrieval-Augmented Generation (RAG) framework coupled with a Large Language Model (LLM), engineered to autonomously generate football match commentary from minimal structured input. The architecture fuses advanced neural language generation with dynamic retrieval of player- and team-specific statistics, enabling the production of detailed and contextually relevant narratives. The approach encompasses the development of an event annotation framework and a user-friendly interface that enables users to select match events and provide relevant contextual details. A thoughtfully engineered prompting strategy, complemented by few-shot examples, directs the LLM to generate coherent and contextually precise commentary while maintaining factual integrity, including information such as goal scorer, assist provider, type of shot, and event timing. Evaluation covers both quantitative metrics-such as accuracy and coverage of events-and qualitative measures, including human judgments of clarity, informativeness, and narrative quality. The results indicate that the RAG-based LLM system achieves an event-level accuracy of 99% and that, based on human evaluations, the generated commentaries were rated at a level comparable to human-written ones. This architecture therefore provides a flexible and scalable solution for automated sports commentary, with potential applications in live broadcasting, online platforms, and as a supportive tool for human commentators.

Relatori: Riccardo Coppola, Anna Arnaudo
Anno accademico: 2025/26
Tipo di pubblicazione: Elettronica
Numero di pagine: 82
Soggetti:
Corso di laurea: Corso di laurea magistrale in Ingegneria Matematica
Classe di laurea: Nuovo ordinamento > Laurea magistrale > LM-44 - MODELLISTICA MATEMATICO-FISICA PER L'INGEGNERIA
Aziende collaboratrici: NON SPECIFICATO
URI: http://webthesis.biblio.polito.it/id/eprint/37151
Modifica (riservato agli operatori) Modifica (riservato agli operatori)