Improving the Transferability of Adversarial Examples by Inverse Knowledge Distillation

Wu, Wenyuan, Liu, Zheng, Chen, Yong, Su, Chao, Peng, Dezhong, Wang, Xu

Feb-24-2025–arXiv.org Artificial Intelligence

In recent years, the rapid development of deep neural networks has brought increased attention to the security and robustness of these models. While existing adversarial attack algorithms have demonstrated success in improving adversarial transferability, their performance remains suboptimal due to a lack of consideration for the discrepancies between target and source models. To address this limitation, we propose a novel method, Inverse Knowledge Distillation (IKD), designed to enhance adversarial transferability effectively. IKD introduces a distillation-inspired loss function that seamlessly integrates with gradient-based attack methods, promoting diversity in attack gradients and mitigating overfitting to specific model architectures. By diversifying gradients, IKD enables the generation of adversarial samples with superior generalization capabilities across different models, significantly enhancing their effectiveness in black-box attack scenarios.

adversarial attack, adversarial sample, transferability, (11 more...)

arXiv.org Artificial Intelligence

Feb-24-2025

arXiv.org PDF

Add feedback

Country:
- Asia > China (0.04)
- Europe > Netherlands
  - North Holland > Amsterdam (0.04)

Genre:
- Research Report > Promising Solution (0.34)

Industry:
- Information Technology > Security & Privacy (1.00)
- Government (1.00)

Technology:
- Information Technology
  - Security & Privacy (1.00)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (0.88)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found