Clarification as Supervision: Reinforcement Learning for Vision-Language Interfaces

Open in new window