MOCA: Self-supervised Representation Learning by Predicting Masked Online Codebook Assignments

Gidaris, Spyros, Bursuc, Andrei, Simeoni, Oriane, Vobecky, Antonin, Komodakis, Nikos, Cord, Matthieu, Pérez, Patrick

Jul-18-2023–arXiv.org Artificial Intelligence

Self-supervised learning can be used for mitigating the greedy needs of Vision Transformer networks for very large fully-annotated datasets. Different classes of self-supervised learning offer representations with either good contextual reasoning properties, e.g., using masked image modeling strategies, or invariance to image perturbations, e.g., with contrastive methods. In this work, we propose a single-stage and standalone method, MOCA, which unifies both desired properties using novel mask-and-predict objectives defined with high-level features (instead of pixel-level details). Moreover, we show how to effectively employ both learning paradigms in a synergistic and computation-efficient way. Doing so, we achieve new state-of-the-art results on low-shot settings and strong experimental results in various evaluation protocols with a training that is at least 3 times faster than prior methods.

artificial intelligence, machine learning, representation, (15 more...)

arXiv.org Artificial Intelligence

Jul-18-2023

arXiv.org PDF

Add feedback

Country:
- Europe > Czechia (0.14)

Genre:
- Research Report (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Neural Networks (0.93)
    - Statistical Learning (1.00)
  - Representation & Reasoning (1.00)
  - Vision (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found