Do Adversarially Robust ImageNet Models Transfer Better? Andrew Ilyas
–Neural Information Processing Systems
Transfer learning is a widely-used paradigm in which models pre-trained on standard datasets can efficiently adapt to downstream tasks. Typically, better pretrained models yield better transfer results, suggesting that initial accuracy is a key aspect of transfer learning performance. In this work, we identify another such aspect: we find that adversarially robust models, while less accurate, often perform better than their standard-trained counterparts when used for transfer learning. Specifically, we focus on adversarially robust ImageNet classifiers, and show that they yield improved accuracy on a standard suite of downstream classification tasks.
Neural Information Processing Systems
May-28-2025, 18:11:53 GMT
- Country:
- North America > United States (0.28)
- Genre:
- Research Report > New Finding (0.49)
- Industry:
- Information Technology (0.46)
- Technology: