Friendly Noise against Adversarial Noise: A Powerful Defense against Data Poisoning Attack

Oct-11-2024, 00:07:05 GMT–Neural Information Processing Systems

A powerful category of (invisible) data poisoning attacks modify a subset of training examples by small adversarial perturbations to change the prediction of certain test-time data. Existing defense mechanisms are not desirable to deploy in practice, as they ofteneither drastically harm the generalization performance, or are attack-specific, and prohibitively slow to apply. Here, we propose a simple but highly effective approach that unlike existing methods breaks various types of invisible poisoning attacks with the slightest drop in the generalization performance. We make the key observation that attacks introduce local sharp regions of high training loss, which when minimized, results in learning the adversarial perturbations and makes the attack successful. To break poisoning attacks, our key idea is to alleviate the sharp loss regions introduced by poisons.

adversarial noise, data poisoning attack, noise, (4 more...)

Neural Information Processing Systems

Oct-11-2024, 00:07:05 GMT

Conferences Web Page

Add feedback

Genre:
- Play > Prospect (0.66)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Inductive Learning (0.62)