Learning Mask-aware CLIP Representations for Zero-Shot Segmentation Siyu Jiao 1,2,3, Y unchao Wei 1,2,3, Y aowei Wang
–Neural Information Processing Systems
Recently, pre-trained vision-language models have been increasingly used to tackle the challenging zero-shot segmentation task.
Neural Information Processing Systems
Oct-8-2025, 21:25:20 GMT