Extending Video Masked Autoencoders to 128 frames

Open in new window