DOTA: DistributiOnal Test-time Adaptation of Vision-Language Models
–Neural Information Processing Systems
However, deploying these models can be unreliable when significant distribution gaps exist between training and test data, while fine-tuning for diverse scenarios is often costly. This creates a need for methods that can efficiently adapt to new data at test time without expensive retraining. Cache-based test-time adapters serve this purpose by storing representative test samples to guide subsequent classifications. Yet, these methods typically employ naive cache management with limited capacity, leading to severe catastrophic forgetting when samples are inevitably dropped during updates. In this paper, we propose Dota(DistributiOnal Test-time Adaptation), a simple yet effective method addressing this limitation. Crucially, instead of merely memorizing individual test samples, Dotacontinuously estimates the underlying distribution of the test data stream. Test-time posterior probabilities are then computed using these dynamically estimated distributions via Bayes' theorem for adaptation. This distribution-centric approach enables the model to continually learn and adapt to the deployment environment. Extensive experiments validate that Dota significantly mitigates forgetting and achieves state-of-the-art performance compared to existing methods.
Neural Information Processing Systems
Jun-22-2026, 20:46:26 GMT
- Genre:
- Research Report
- New Finding (1.00)
- Experimental Study (1.00)
- Research Report
- Industry:
- Health & Medicine (0.46)
- Education > Educational Setting (0.46)
- Technology:
- Information Technology
- Sensing and Signal Processing > Image Processing (0.93)
- Artificial Intelligence
- Vision (1.00)
- Natural Language (1.00)
- Representation & Reasoning > Uncertainty
- Bayesian Inference (0.66)
- Machine Learning
- Statistical Learning (0.93)
- Neural Networks (0.68)
- Learning Graphical Models > Directed Networks
- Bayesian Learning (0.66)
- Information Technology