Reverse Engineering Self-Supervised Learning

Jan-19-2025, 19:59:46 GMT–Neural Information Processing Systems

Understanding the learned representation and underlying mechanisms of Self-Supervised Learning (SSL) often poses a challenge. In this paper, we'reverse engineer' SSL, conducting an in-depth empirical analysis of its learned internal representations, encompassing diverse models, architectures, and hyperparameters. Our study reveals an intriguing process within the SSL training: an inherent facilitation of semantic label-based clustering, which is surprisingly driven by the regularization component of the SSL objective. This clustering not only enhances downstream classification, but also compresses the information. We further illustrate that the alignment of the SSL-trained representation is more pronounced with semantic classes rather than random functions.

engineering self-supervised learning, representation, self-supervised learning, (2 more...)

Neural Information Processing Systems

Jan-19-2025, 19:59:46 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (0.71)
  - Machine Learning > Inductive Learning (0.66)