b618c3210e934362ac261db280128c22-Paper.pdf

Feb-10-2025, 09:30:55 GMT–Neural Information Processing Systems

Test-time training (TTT) through self-supervised learning (SSL) is an emerging paradigm to tackle distributional shifts. Despite encouraging results, it remains unclear when this approach thrives or fails. In this work, we first provide an indepth look at its limitations and show that TTT can possibly deteriorate, instead of improving, the test-time performance in the presence of severe distribution shifts. To address this issue, we introduce a test-time feature alignment strategy utilizing offline feature summarization and online moment matching, which regularizes adaptation without revisiting training data. We further scale this strategy in the online setting through batch-queue decoupling to enable robust moment estimates even with limited batch size. Given aligned feature distributions, we then shed light on the strong potential of TTT by theoretically analyzing its performance post adaptation.

adaptation, artificial intelligence, machine learning, (13 more...)

Neural Information Processing Systems

Feb-10-2025, 09:30:55 GMT

Conferences PDF

Add feedback

Genre:
- Research Report > New Finding (0.68)