Enhancing End-to-End Multiple Object Tracking with Efficient Propagation Pre-Training and Complementary Long-range Tracklet Re-Identification

Lorenzo Feliziani

Enhancing End-to-End Multiple Object Tracking with Efficient Propagation Pre-Training and Complementary Long-range Tracklet Re-Identification.

Rel. Tatiana Tommasi, Ender Konukoglu, Mattia Segù, Luc Van Gool. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Matematica, 2025

Preview

PDF (Tesi_di_laurea) - Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives.
Download (26MB) | Preview

Abstract

Tracking-by-propagation has emerged as a promising paradigm in Multi-Object Tracking (MOT), leveraging feature propagation across frames to maintain object identities within an end-to-end architecture. Existing approaches, such as SambMOTR, have demonstrated impressive performance, often rivaling or surpassing tracking-by-detection methods, all while utilizing a single fully trainable architecture based on DETR-like detectors. However, their effectiveness is limited by several important challenges. Jointly training a strong pre-trained detector and a propagation module from scratch often disrupts the detector's initialization, resulting in performance degradation. Additionally, tracking-by-propagation struggles to maintain tracklet consistency during long occlusions, leading to identity switches and deteriorating tracking performance. To address these limitations, we propose SambaMOTRv2, a novel framework that enhances tracking-by-propagation performance by integrating within SambaMOTR a memory- and time-efficient pre-training strategy for the propagating parts of the algorithm and a Samba-based re-identification system.

Our pre-training approach alleviates the initialization issue by ensuring strong training of the propagation module, enabling end-to-end tracking-by-propagation to surpass the performance of the baseline method after joint training