Improved Visual Grounding through Self-Consistent Explanations

Open in new window