Learning Mask-aware CLIP Representations for Zero-Shot Segmentation Siyu Jiao 1,2,3, Y unchao Wei 1,2,3, Y aowei Wang

Neural Information Processing Systems 

Recently, pre-trained vision-language models have been increasingly used to tackle the challenging zero-shot segmentation task.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found