Andrea Borgno
Vision Transformers for Surgical Scene Understanding and Skill Assessment in Minimally Invasive Robotic Suturing.
Rel. Kristen Mariko Meiburger, Francesco Marzola, Alberto Arezzo. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Biomedica, 2025
|
|
PDF (Tesi_di_laurea)
- Tesi
Accesso limitato a: Solo utenti staff fino al 23 Luglio 2026 (data di embargo). Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (37MB) |
Abstract
Minimally invasive robotic surgery has marked a revolution in clinical practice, and a significant aspect of its future evolution involves analyzing endoscopic visual data to develop intelligent assistance functionalities. One of the main challenges is understanding the surgical workflow, which involves recognizing activities at multiple levels of granularity, from macro-level procedural phases to micro-level atomic actions. This work addresses this challenge by focusing on a paradigmatic and complex task for robotic surgery: suturing. The first contribution is the creation and annotation of the LUMICS (Multi-Level Understanding of Minimally Invasive Colon Suturing) dataset, consisting of 15 videos of suturing procedures on porcine colon, performed with the da Vinci Research Kit (dVRK) system at the MITIC laboratory, Department of Surgical Sciences, University of Turin.
The primary objective is the development and validation of a Computer Vision pipeline for the multi-level analysis of suturing videos from LUMICS, including the recognition of 3 surgical steps, 12 atomic actions, and the segmentation of 2 surgical instruments
Tipo di pubblicazione
URI
![]() |
Modifica (riservato agli operatori) |
