Giuseppe Antonio Gentile
A Modular LLM‑Based Framework for Semantic Navigation and Perception in Mobile Robotics.
Rel. Alessandro Rizzo, Pangcheng David Cen Cheng. Politecnico di Torino, Corso di laurea magistrale in Mechatronic Engineering (Ingegneria Meccatronica), 2025
|
Preview |
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (8MB) | Preview |
Abstract
As robots become increasingly integrated into human environments, the ability to interact intuitively through natural language has become a crucial goal. Traditional robotic control systems require structured inputs and predefined behaviors, limiting their adaptability in dynamic, real world environments. Recent advances in Large Language Models (LLM) offer a new paradigm: harnessing language as a general interface for reasoning, perception, and decision making. However, integrating LLMs with embodied agents presents fundamental challenges, including grounding instructions in physical space, ensuring safety, and linking symbolic language and low-level robotic actions. This thesis explores the integration of Large Language Models (LLMs) into robotic systems for natural language-driven control and perception.
The proposed architecture connects a GPT-based reasoning agent with a ROS~2 navigation stack and a visual pipeline comprising BLIP for visual question answering and YOLOv8 with depth sensing for object localization
Relatori
Anno Accademico
Tipo di pubblicazione
Numero di pagine
Corso di laurea
Classe di laurea
URI
![]() |
Modifica (riservato agli operatori) |
