WATT: Weight Average Test Time Adaptation of CLIP
–Neural Information Processing Systems
Vision-Language Models (VLMs) such as CLIP have yielded unprecedented performances for zero-shot image classification, yet their generalization capability may still be seriously challenged when confronted to domain shifts.
Neural Information Processing Systems
Dec-25-2025, 22:56:59 GMT
- Technology:
- Information Technology > Artificial Intelligence
- Natural Language (0.60)
- Vision (0.60)
- Information Technology > Artificial Intelligence