A Closer Look at Weakly-Supervised Audio-Visual Source Localization Shentong Mo Carnegie Mellon University Pedro Morgado University of Wisconsin-Madison

Neural Information Processing Systems 

Audio-visual source localization is a challenging task that aims to predict the location of visual sound sources in a video.