Low-Loss Subspace Compression for Clean Gains against Multi-Agent Backdoor Attacks
Datta, Siddhartha, Shadbolt, Nigel
–arXiv.org Artificial Intelligence
Recent exploration of the multi-agent backdoor attack demonstrated the backfiring effect, a natural defense against backdoor attacks where backdoored inputs are randomly classified. This yields a side-effect of low accuracy w.r.t. clean labels, which motivates this paper's work on the construction of multi-agent backdoor defenses that maximize accuracy w.r.t. clean labels and minimize that of poison labels. Founded upon agent dynamics and low-loss subspace construction, we contribute three defenses that yield improved multi-agent backdoor robustness.
arXiv.org Artificial Intelligence
Sep-20-2022
- Country:
- Asia > Nepal (0.04)
- Europe > United Kingdom
- England > Oxfordshire > Oxford (0.14)
- Genre:
- Research Report (0.40)
- Industry:
- Information Technology > Security & Privacy (0.93)
- Technology: