Zero-shot Visual Relation Detection via Composite Visual Cues from Large Language Models Lin Li

Open in new window