Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts Ahmad Beirami

Feb-11-2025, 07:47:31 GMT–Neural Information Processing Systems

Video self-supervised learning (VSSL) has made significant progress in recent years. However, the exact behavior and dynamics of these models under different forms of distribution shift are not yet known. In this paper, we comprehensively study the behavior of six popular self-supervised methods (v-SimCLR, v-MoCo, v-BYOL, v-SimSiam, v-DINO, v-MAE) in response to various forms of natural distribution shift, i.e., (i) context shift, (ii) viewpoint shift, (iii) actor shift, (iv) source shift, (v) generalizability to unknown classes (zero-shot), and (vi) open-set recognition. To perform this extensive study, we carefully craft a test bed consisting of 17 in-distribution and out-of-distribution benchmark pairs using available public datasets and a series of evaluation protocols to stress-test the different methods under the intended shifts.

large language model, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Feb-11-2025, 07:47:31 GMT

Conferences PDF

Add feedback

Country:
- North America (0.27)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Health & Medicine > Consumer Health (1.00)
- Leisure & Entertainment > Sports
  - Track & Field (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Inductive Learning (0.70)
    - Neural Networks > Deep Learning (0.67)
    - Statistical Learning (0.92)
  - Natural Language > Large Language Model (0.89)
  - Representation & Reasoning (1.00)
  - Vision (1.00)