Mitigating Backdoor Attack by Injecting Proactive Defensive Backdoor Shaokui Wei Baoyuan Wu

May-31-2025, 11:31:59 GMT–Neural Information Processing Systems

Data-poisoning backdoor attacks are serious security threats to machine learning models, where an adversary can manipulate the training dataset to inject backdoors into models. In this paper, we focus on in-training backdoor defense, aiming to train a clean model even when the dataset may be potentially poisoned. Unlike most existing methods that primarily detect and remove/unlearn suspicious samples to mitigate malicious backdoor attacks, we propose a novel defense approach called PDB (Proactive Defensive Backdoor).

artificial intelligence, backdoor attack, machine learning, (19 more...)

Neural Information Processing Systems

May-31-2025, 11:31:59 GMT

Conferences PDF

Add feedback

Country:
- Asia > China (0.28)

Genre:
- Research Report
  - Experimental Study (0.93)
  - New Finding (1.00)

Industry:
- Information Technology > Security & Privacy (1.00)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Neural Networks
      - Deep Learning (0.46)
    - Vision (1.00)
  - Security & Privacy (1.00)
  - Sensing and Signal Processing > Image Processing (1.00)