Meta-Adapter: An Online Few-shot Learner for Vision-Language Model

Neural Information Processing Systems 

The contrastive vision-language pre-training, known as CLIP, demonstrates remarkable potential in perceiving open-world visual concepts, enabling effective zero-shot image recognition.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found