Boosting the Transferability of Adversarial Attack on Vision Transformer with Adaptive Token Tuning

Neural Information Processing Systems 

Extensive experiments conducted on ViTs, undefended CNNs, and defended CNNs validate the superiority of our proposed A TT attack method. On average, our approach improves the attack performance by 10.1%