Interpretable Open-Vocabulary Referring Object Detection with Reverse Contrast Attention

Open in new window