| International Journal of Computer Applications |
| Foundation of Computer Science (FCS), NY, USA |
| Volume 187 - Number 62 |
| Year of Publication: 2025 |
| Authors: Mahmoud Khalil, Ahmad Khalil, Alioune Ngom |
10.5120/ijca2025926002
|
Mahmoud Khalil, Ahmad Khalil, Alioune Ngom . Representation Learning with Adaptive Superpixel Coding. International Journal of Computer Applications. 187, 62 ( Dec 2025), 1-17. DOI=10.5120/ijca2025926002
Deep learning vision models are typically tailored for specific modalities and often rely on domain-specific assumptions, such as the grid structures used by most existing architectures. This paper introduces a self-supervised Transformer-based model called Adaptive Superpixel Coding (ASC). The key idea behind the approach is to address the limitations of traditional Vision Transformers, which depend on fixed-size and non-adaptive patch partitioning. Instead, ASC employs adaptive superpixel layers that dynamically adjust to the underlying image content. The study analyzes the properties that make the proposed method effective and demonstrates that the approach outperforms widely used baselines on standard image downstream task benchmarks.