Revisiting Adversarial Training for ImageNet: Architectures, Training and Generalization across Threat Models
–Neural Information Processing Systems
While adversarial training has been extensively studied for ResNet architectures and low resolution datasets like CIFAR-10, much less is known for ImageNet. Given the recent debate about whether transformers are more robust than convnets, we revisit adversarial training on ImageNet comparing ViTs and ConvNeXts. Extensive experiments show that minor changes in architecture, most notably replacing PatchStem with ConvStem, and training scheme have a significant impact on the achieved robustness.
Neural Information Processing Systems
May-28-2025, 18:33:15 GMT
- Country:
- Europe > Germany > Baden-Württemberg > Tübingen Region > Tübingen (0.14)
- Genre:
- Research Report (0.46)
- Technology: