Test-Time Prompt Tuning for Zero-Shot Generalization in Vision-Language Models

Neural Information Processing Systems 

Pre-trained vision-language models (e.g., CLIP) have shown promising zero-shot

Similar Docs  Excel Report  more

TitleSimilaritySource
None found