Enhancing Self-Supervised Learning with Semantic Pairs A New Dataset and Empirical Study
Alkhalefi, Mohammad, Leontidis, Georgios, Zhong, Mingjun
–arXiv.org Artificial Intelligence
Instance discrimination is a self-supervised representation learning paradigm wherein individual instances within a dataset are treated as distinct classes. This is typically achieved by generating two disparate views of each instance by applying stochastic transformations, which encourages the model to learn representations that are invariant to the common underlying object across these views. While this approach facilitates the acquisition of invariant representations for dataset instances under various handcrafted transformations (e.g., random cropping, color jittering), an exclusive reliance on such data transformations for achieving invariance may inherently limit the model's generalization to unseen datasets and diverse downstream tasks. The inherent limitation stems from the fact that the finite set of transformations within the data processing pipeline is unable to encompass the full spectrum of potential data variations. In this study, we provide the technical foundation for leveraging semantic pairs to enhance the generalization of the model's representation and empirically demonstrate that incorporating semantic pairs mitigates the issue of limited transformation coverage. Specifically, we propose that exposing the model to semantic pairs (i.e., two instances belonging to the same semantic category) introduces varied real-world scene contexts, thereby fostering the development of more generalizable object representations. To validate this hypothesis, we constructed and released a novel dataset comprising curated semantic pairs and conducted extensive experimentation to empirically establish that their inclusion enables the model to learn more general representations, ultimately leading to improved performance across diverse downstream tasks.
arXiv.org Artificial Intelligence
Oct-14-2025
- Country:
- Europe > United Kingdom (0.14)
- Oceania > New Zealand
- South Island > Marlborough District > Blenheim (0.04)
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning
- Inductive Learning (0.84)
- Neural Networks (0.67)
- Natural Language (1.00)
- Representation & Reasoning (1.00)
- Vision (1.00)
- Machine Learning
- Information Technology > Artificial Intelligence