Dual Manifold Adversarial Robustness: Defense against Lp and non-Lp Adversarial Attacks

Oct-9-2024, 18:32:29 GMT–Neural Information Processing Systems

Adversarial training is a popular defense strategy against attack threat models with bounded Lp norms. However, it often degrades the model performance on normal images and more importantly, the defense does not generalize well to novel attacks. Given the success of deep generative models such as GANs and VAEs in characterizing the underlying manifold of images, we investigate whether or not the aforementioned deficiencies of adversarial training can be remedied by exploiting the underlying manifold information. To partially answer this question, we consider the scenario when the manifold information of the underlying data is available. We use a subset of ImageNet natural images where an approximate underlying manifold is learned using StyleGAN.

dual manifold adversarial robustness, lp and non-lp adversarial attack, manifold information, (5 more...)

Neural Information Processing Systems

Oct-9-2024, 18:32:29 GMT

Conferences Web Page

Add feedback

Industry:
- Information Technology > Security & Privacy (0.46)
- Government > Military (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.57)