Cooperative Learning of Audio and Video Models from Self-Supervised Synchronization

Open in new window