ILLUME: Rationalizing Vision-Language Models through Human Interactions

Open in new window