Investigating the Role of Attribute Context in Vision-Language Models for Object Recognition and Detection

Open in new window