Natural Language Generation for Automated Sports Broadcasting
Davide Omento
Natural Language Generation for Automated Sports Broadcasting.
Rel. Riccardo Coppola, Anna Arnaudo. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Matematica, 2025
|
Preview |
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (1MB) | Preview |
Abstract
This thesis addresses the design and implementation of an intelligent system built on a Retrieval-Augmented Generation (RAG) framework coupled with a Large Language Model (LLM), engineered to autonomously generate football match commentary from minimal structured input. The architecture fuses advanced neural language generation with dynamic retrieval of player- and team-specific statistics, enabling the production of detailed and contextually relevant narratives. The approach encompasses the development of an event annotation framework and a user-friendly interface that enables users to select match events and provide relevant contextual details. A thoughtfully engineered prompting strategy, complemented by few-shot examples, directs the LLM to generate coherent and contextually precise commentary while maintaining factual integrity, including information such as goal scorer, assist provider, type of shot, and event timing.
Evaluation covers both quantitative metrics-such as accuracy and coverage of events-and qualitative measures, including human judgments of clarity, informativeness, and narrative quality
Tipo di pubblicazione
URI
![]() |
Modifica (riservato agli operatori) |
