Object-centric binding in Contrastive Language-Image Pretraining

Open in new window