Learning Representations from Audio-Visual Spatial Alignment

Dec-23-2025, 22:17:47 GMT–Neural Information Processing Systems

We introduce a novel self-supervised pretext task for learning representations from audio-visual content.

artificial intelligence, proceedings, representation, (9 more...)

Neural Information Processing Systems

Dec-23-2025, 22:17:47 GMT

Conferences Web Page

Technology:
- Information Technology > Artificial Intelligence (0.43)