Ewen Aymerick Youenn Rondel
Spatial-Temporal Consistency Enhanced Segmentation for Laparoscopic Surgical Videos.
Rel. Alessio Sacco, Guido Marchetto, Flavio Esposito. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2026
|
Preview |
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (6MB) | Preview |
Abstract
Accurate multi-organ semantic segmentation in laparoscopic surgery videos is essential for computer-assisted interventions, enabling organ recognition, instrument tracking, and context-aware guidance. However, surgical scenes exhibit several intrinsic challenges: large non-rigid organ deformations, occlusions caused by instruments or blood, rapid viewpoint changes, motion blur, and heterogeneous lightning. These factors undermine the temporal stability and boundary precision of conventional single-frame segmentation networks. To overcome these limitations, this thesis proposes SSTC-Seg, a deformable memory-based multi-scale architecture specifically designed to enforce spatial adaptivity and temporal coherence in minimally invasive surgical environments. SSTC-Seg integrates three complementary components into a unified, end-to-end trainable architecture. First, a deformable multi-scale encoder —combining deformable convolutions, multi-scale feature extraction, and lightweight self-attention— adapts receptive fields to non-rigid anatomical structures while capturing both fine-grained and global contextual cues.
Second, a memory-based attention mechanism aggregates information from a memory bank of past frames embeddings through stacked self-attention and cross-attention blocks
Relatori
Anno Accademico
Tipo di pubblicazione
Numero di pagine
Corso di laurea
Classe di laurea
URI
![]() |
Modifica (riservato agli operatori) |
