Cross-Architectural Positive Pairs improve the effectiveness of Self-Supervised Learning
Singh, Pranav, Cirrone, Jacopo
–arXiv.org Artificial Intelligence
Existing self-supervised techniques have extreme computational requirements and suffer a substantial drop in performance with a reduction in batch size or pretraining epochs. This paper presents Cross Architectural - Self Supervision (CASS), a novel self-supervised learning approach that leverages Transformer and CNN simultaneously. Compared to the existing state-of-the-art self-supervised learning approaches, we empirically show that CASS-trained CNNs and Transformers across four diverse datasets gained an average of 3.8% with 1% labeled data, 5.9% with 10% labeled data, and 10.13% with 100% labeled data while taking 69% less time. We also show that CASS is much more robust to changes in batch size and training epochs than existing state-of-the-art self-supervised learning approaches. We have open-sourced our code at https://github.com/pranavsinghps1/CASS.
arXiv.org Artificial Intelligence
Jan-27-2023
- Country:
- North America > United States
- New York > New York County > New York City (0.04)
- Asia > Middle East
- Israel > Tel Aviv District > Tel Aviv (0.04)
- North America > United States
- Genre:
- Research Report > New Finding (0.93)
- Industry:
- Health & Medicine
- Diagnostic Medicine (0.69)
- Therapeutic Area
- Immunology (0.94)
- Dermatology (0.93)
- Oncology (0.68)
- Health & Medicine
- Technology: