Michele Russo
From IDE to Edge: A Cloud-Native Multi-Agent Framework for Automated Edge AI Deployment on STM32.
Rel. Gianvito Urgese, Giuseppe Fanuli, Andrea Pignata. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2026
|
Preview |
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (8MB) | Preview |
Abstract
The deployment of Artificial Intelligence (AI) on resource-constrained embedded systems, commonly known as Edge AI, is often hindered by stringent memory, computational power, and energy constraints. While the STM32 ecosystem offers specialized tools such as STM32CubeMX and X-CUBE-AI, orchestrating the full Machine Learning Operations (MLOps) lifecycle, from firmware configuration to model optimization and deployment, remains a complex and primarily manual process. This thesis introduces a scalable Agentic MLOps Orchestration Framework designed to automate and optimize the Edge AI development workflow on STM32 microcontrollers. To move beyond the limitations of local execution, the proposed architecture adopts a cloud-native approach that maximizes resource utilization by decoupling the multi-agent orchestration logic (CPU-bound) from the Large Language Model (LLM) inference engine (GPU-bound).
A central contribution of this work is the adoption of a scalable inference architecture that transitions from monolithic, resource-monopolizing environments to a distributed model where high-performance LLMs are served as shared network resources
Relatori
Anno Accademico
Tipo di pubblicazione
Numero di pagine
Corso di laurea
Classe di laurea
Aziende collaboratrici
URI
![]() |
Modifica (riservato agli operatori) |
