Robust Egocentric Referring Video Object Segmentation via Dual-Modal Causal Intervention

Open in new window