Weakly-Supervised Audio-Visual Segmentation

Oct-8-2025, 11:03:59 GMT–Neural Information Processing Systems

Audio-visual segmentation is a challenging task that aims to predict pixel-level masks for sound sources in a video. Previous work applied a comprehensive manually designed architecture with countless pixel-wise accurate masks as supervision. However, these pixel-level masks are expensive and not available in all cases.

computer vision, proceedings, segmentation, (11 more...)

Neural Information Processing Systems

Oct-8-2025, 11:03:59 GMT

Conferences PDF

Add feedback

Country:
- Asia > China (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Vision (1.00)
  - Natural Language (1.00)
  - Machine Learning (1.00)

Duplicate Docs Excel Report

Title
377b2e39e97e917b9e625b35241e33df-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found