Interpretable Open-Vocabulary Referring Object Detection with Reverse Contrast Attention