Mask2KAN: A Universal Image Segmentation Kolmogorov–Arnold Network Architecture
Gianluca Guzzetta
Mask2KAN: A Universal Image Segmentation Kolmogorov–Arnold Network Architecture.
Rel. Carlo Masone, Shyam Nandan Rai. Politecnico di Torino, Corso di laurea magistrale in Data Science And Engineering, 2024
|
Preview |
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (8MB) | Preview |
|
|
Archive (ZIP) (Documenti_allegati)
- Altro
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (84MB) |
Abstract
Universal architectures like Mask2Former have redefined the way we approach image segmentation tasks. Traditionally, specialized architectures were used for specific tasks such as semantic, instance, and panoptic segmentation. Now, a single, unified architecture can outperform these task-specific models, offering benefits in performance, efficiency, and effort, while also reshaping the way we perceive these tasks. In this paper, experiments are conducted using the Mask2Former configuration for \textit{semantic segmentation}. However, similar to other universal models like DETR, these architectures, despite sharing the same underlying structure, \textit{are still trained separately for different tasks and datasets}. Recent works on the passage from the Universal Approximation Theorem to a Kolmogorov-Arnold theorem inspired the present work to delve in Kolmogorov Arnold Network on computer vision tasks.
Traditional semantic segmentation models as Mask2Former, recognize a predefined set of classes, often failing to detect unseen objects (anomalies)
Tipo di pubblicazione
URI
![]() |
Modifica (riservato agli operatori) |
