A Simple Recipe for Contrastively Pre-training Video-First Encoders Beyond 16 Frames