Cooperative Learning of Audio and Video Models from Self-Supervised Synchronization
Korbar, Bruno, Tran, Du, Torresani, Lorenzo
–Neural Information Processing Systems
There is a natural correlation between the visual and auditive elements of a video. In this work we leverage this connection to learn general and effective models for both audio and video analysis from self-supervised temporal synchronization. We demonstrate that a calibrated curriculum learning scheme, a careful choice of negative examples, and the use of a contrastive loss are critical ingredients to obtain powerful multi-sensory representations from models optimized to discern temporal synchronization of audio-video pairs. Without further fine-tuning, the resulting audio features achieve performance superior or comparable to the state-of-the-art on established audio classification benchmarks (DCASE2014 and ESC-50). At the same time, our visual subnet provides a very effective initialization to improve the accuracy of video-based action recognition models: compared to learning from scratch, our self-supervised pretraining yields a remarkable gain of +19.9% in action recognition accuracy on UCF101 and a boost of +17.7% on HMDB51.
Neural Information Processing Systems
Dec-31-2018
- Country:
- Asia > Taiwan
- Taiwan Province > Taipei (0.04)
- Europe
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Italy > Veneto
- Venice (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Germany > Bavaria
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- United States
- Colorado > El Paso County
- Colorado Springs (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Massachusetts > Suffolk County
- Boston (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Nevada > Clark County
- Las Vegas (0.04)
- New York > New York County
- New York City (0.04)
- Ohio > Franklin County
- Columbus (0.04)
- Colorado > El Paso County
- Canada
- Oceania > Australia
- New South Wales > Sydney (0.04)
- South America > Chile
- Asia > Taiwan
- Technology: