Weakly-Supervised Audio-Visual Segmentation

Neural Information Processing Systems 

Audio-visual segmentation is a challenging task that aims to predict pixel-level masks for sound sources in a video. Previous work applied a comprehensive manually designed architecture with countless pixel-wise accurate masks as supervision. However, these pixel-level masks are expensive and not available in all cases.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found