Intra-Modal Proxy Learning for Zero-Shot Visual Categorization with CLIP

Open in new window