Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training

Open in new window