Text-Guided Attention is All You Need for Zero-Shot Robustness in Vision-Language Models Lu Y u

Neural Information Processing Systems 

CLIP), have attracted widespread attention and adoption across various domains. Nonetheless, CLIP has been observed to be susceptible to adversarial examples.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found