Think Before You Segment: An Object-aware Reasoning Agent for Referring Audio-Visual Segmentation

Open in new window