Generative Poisoning Using Random Discriminators
van Vlijmen, Dirren, Kolmus, Alex, Liu, Zhuoran, Zhao, Zhengyu, Larson, Martha
–arXiv.org Artificial Intelligence
We introduce ShortcutGen, a new data poisoning attack that generates sample-dependent, error-minimizing perturbations by learning a generator. The key novelty of ShortcutGen is the use of a randomly-initialized discriminator, which provides spurious shortcuts needed for generating poisons. Different from recent, iterative methods, our ShortcutGen can generate perturbations with only one forward pass in a label-free manner, and compared to the only existing generative method, DeepConfuse, our ShortcutGen is faster and simpler to train while remaining competitive. We also demonstrate that integrating a simple augmentation strategy can further boost the robustness of ShortcutGen against early stopping, and combining augmentation and non-augmentation leads to new state-of-the-art results in terms of final validation accuracy, especially in the challenging, transfer scenario. Lastly, we speculate, through uncovering its working mechanism, that learning a more general representation space could allow ShortcutGen to work for unseen data.
arXiv.org Artificial Intelligence
Nov-2-2022
- Country:
- Asia > China (0.05)
- Europe
- Germany > Saarland
- Saarbrücken (0.04)
- Netherlands > Gelderland
- Nijmegen (0.04)
- Germany > Saarland
- North America > Canada
- Genre:
- Research Report (0.50)
- Technology: