How to choose your best allies for a transferable attack?
Maho, Thibault, Moosavi-Dezfooli, Seyed-Mohsen, Furon, Teddy
–arXiv.org Artificial Intelligence
The transferability of adversarial examples is a key issue in the security of deep neural networks. The possibility of an adversarial example crafted for a source model fooling another targeted model makes the threat of adversarial attacks more realistic. Measuring transferability is a crucial problem, but the Attack Success Rate alone does not provide a sound evaluation. This paper proposes a new methodology for evaluating transferability by putting distortion in a central position. This new tool shows that transferable attacks may perform far worse than a black box attack if the attacker randomly picks the source model. To address this issue, we propose a new selection mechanism, called FiT, which aims at choosing the best source model with only a few preliminary queries to the target. Our experimental results show that FiT is highly effective at selecting the best source model for multiple scenarios such as single-model attacks, ensemble-model attacks and multiple attacks (Code available at: https://github.com/t-maho/transferability_measure_fit).
arXiv.org Artificial Intelligence
Jul-16-2023
- Country:
- Europe
- United Kingdom > England
- Greater London > London (0.04)
- France > Brittany
- Ille-et-Vilaine > Rennes (0.04)
- United Kingdom > England
- Asia > China
- Tianjin Province > Tianjin (0.04)
- Europe
- Genre:
- Research Report > New Finding (0.66)
- Industry:
- Information Technology > Security & Privacy (1.00)
- Technology: