Encoding Robustness to Image Style via Adversarial Feature Perturbations

Jan-19-2025, 12:10:28 GMT–Neural Information Processing Systems

Adversarial training is the industry standard for producing models that are robust to small adversarial perturbations. However, machine learning practitioners need models that are robust to other kinds of changes that occur naturally, such as changes in the style or illumination of input images. Such changes in input distribution have been effectively modeled as shifts in the mean and variance of deep image features. We adapt adversarial training by directly perturbing feature statistics, rather than image pixels, to produce models that are robust to various unseen distributional shifts. We explore the relationship between these perturbations and distributional shifts by visualizing adversarial features.

adversarial feature perturbation, distributional shift, encoding robustness, (3 more...)

Neural Information Processing Systems

Jan-19-2025, 12:10:28 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology
  - Sensing and Signal Processing > Image Processing (0.63)
  - Artificial Intelligence > Machine Learning (0.62)