WATT: Weight Average Test Time Adaptation of CLIP

Mar-20-2026, 16:37:21 GMT–Neural Information Processing Systems

Vision-Language Models (VLMs) such as CLIP have yielded unprecedented performances for zero-shot image classification, yet their generalization capability may still be seriously challenged when confronted to domain shifts.

artificial intelligence, natural language, proceedings, (6 more...)

Neural Information Processing Systems

Mar-20-2026, 16:37:21 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Vision (0.60)
  - Natural Language (0.60)