Goto

Collaborating Authors

 Large Language Model







Uncovering the Hidden Dynamics of Video Self-supervised Learning under Distribution Shifts

Neural Information Processing Systems

Specifically, we pose and answer the following questions: Q1. How do the learned spatial and temporal representations vary based on different VSSL pretrain-ing methodologies? How robust are these representations to different distribution shifts?