Adversarial Distributional Training for Robust Deep Learning

Oct-10-2024, 07:58:01 GMT–Neural Information Processing Systems

Adversarial training (AT) is among the most effective techniques to improve model robustness by augmenting training data with adversarial examples. However, most existing AT methods adopt a specific attack to craft adversarial examples, leading to the unreliable robustness against other unseen attacks. Besides, a single attack algorithm could be insufficient to explore the space of perturbations. In this paper, we introduce adversarial distributional training (ADT), a novel framework for learning robust models. ADT is formulated as a minimax optimization problem, where the inner maximization aims to learn an adversarial distribution to characterize the potential adversarial examples around a natural one under an entropic regularizer, and the outer minimization aims to train robust models by minimizing the expected loss over the worst-case adversarial distributions.

adversarial distributional training, adversarial example, robust deep learning, (4 more...)

Neural Information Processing Systems

Oct-10-2024, 07:58:01 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (0.64)
  - Machine Learning > Neural Networks
    - Deep Learning (0.40)