Masked Autoencoders As Spatiotemporal Learners

Open in new window