Enhancing End-to-End Multiple Object Tracking with Efficient Propagation Pre-Training and Complementary Long-range Tracklet Re-Identification

Lorenzo Feliziani

Enhancing End-to-End Multiple Object Tracking with Efficient Propagation Pre-Training and Complementary Long-range Tracklet Re-Identification.

Rel. Tatiana Tommasi, Ender Konukoglu, Mattia Segù, Luc Van Gool. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Matematica, 2025

PDF (Tesi_di_laurea) - Tesi
Accesso riservato a: Solo utenti staff fino al 14 Marzo 2026 (data di embargo).
Licenza: Creative Commons Attribution Non-commercial No Derivatives.
Download (26MB)

Abstract:	Tracking-by-propagation has emerged as a promising paradigm in Multi-Object Tracking (MOT), leveraging feature propagation across frames to maintain object identities within an end-to-end architecture. Existing approaches, such as SambMOTR, have demonstrated impressive performance, often rivaling or surpassing tracking-by-detection methods, all while utilizing a single fully trainable architecture based on DETR-like detectors. However, their effectiveness is limited by several important challenges. Jointly training a strong pre-trained detector and a propagation module from scratch often disrupts the detector's initialization, resulting in performance degradation. Additionally, tracking-by-propagation struggles to maintain tracklet consistency during long occlusions, leading to identity switches and deteriorating tracking performance. To address these limitations, we propose SambaMOTRv2, a novel framework that enhances tracking-by-propagation performance by integrating within SambaMOTR a memory- and time-efficient pre-training strategy for the propagating parts of the algorithm and a Samba-based re-identification system. Our pre-training approach alleviates the initialization issue by ensuring strong training of the propagation module, enabling end-to-end tracking-by-propagation to surpass the performance of the baseline method after joint training. The introduction of a long-range tracklet re-identification module allows the model to merge fragmented tracklets across prolonged occlusions, recovering tracklets from occlusions up to 300% longer than those in the previous method. Unlike prior approaches, our re-identification module performs online tracklet re-identification and merging, trained using video-level positive and negative pairs, rather than keyframe reference-frame pairs. Experimental results show that the integration of the implemented methods leads to a significant performance boost, allowing our model to outperform previous state-of-the-art methods on complex datasets such as DanceTrack and SportsMOT. These results highlight the effectiveness of our approach in addressing key challenges in tracking-by-propagation and substantially improving both tracking accuracy and identity preservation.
Relatori:	Tatiana Tommasi, Ender Konukoglu, Mattia Segù, Luc Van Gool
Anno accademico:	2024/25
Tipo di pubblicazione:	Elettronica
Numero di pagine:	72
Soggetti:
Corso di laurea:	Corso di laurea magistrale in Ingegneria Matematica
Classe di laurea:	Nuovo ordinamento > Laurea magistrale > LM-44 - MODELLISTICA MATEMATICO-FISICA PER L'INGEGNERIA
Ente in cotutela:	ETH Zurich (SVIZZERA)
Aziende collaboratrici:	ETH Zurich
URI:	http://webthesis.biblio.polito.it/id/eprint/34734

Modifica (riservato agli operatori)