Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition

Open in new window