Grounding Language Models for Visual Entity Recognition

Open in new window