Improved Network Robustness with Adversary Critic

Mar-17-2026, 02:37:56 GMT–Neural Information Processing Systems

Ideally, what confuses neural network should be confusing to humans. However, recent experiments have shown that small, imperceptible perturbations can change the network prediction. To address this gap in perception, we propose a novel approach for learning robust classifier. Our main idea is: adversarial examples for the robust classifier should be indistinguishable from the regular data of the adversarial target. We formulate a problem of learning robust classifier in the framework of Generative Adversarial Networks (GAN), where the adversarial attack on classifier acts as a generator, and the critic network learns to distinguish between regular and adversarial images.

artificial intelligence, machine learning, neurips proceedings improved network robustness, (7 more...)

Neural Information Processing Systems

Mar-17-2026, 02:37:56 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.60)