Do Adversarially Robust ImageNet Models Transfer Better?

Dec-23-2025, 20:58:37 GMT–Neural Information Processing Systems

Transfer learning is a widely-used paradigm in deep learning, where models pre-trained on standard datasets can be efficiently adapted to downstream tasks. Typically, better pre-trained models yield better transfer results, suggesting that initial accuracy is a key aspect of transfer learning performance. In this work, we identify another such aspect: we find that adversarially robust models, while less accurate, often perform better than their standard-trained counterparts when used for transfer learning. Specifically, we focus on adversarially robust ImageNet classifiers, and show that they yield improved accuracy on a standard suite of downstream classification tasks.

better, name change, proceedings, (3 more...)

Neural Information Processing Systems

Dec-23-2025, 20:58:37 GMT

Conferences Web Page

Add feedback

Genre:
- Research Report > New Finding (0.41)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (1.00)