ARDIR: Improving Robustness using Knowledge Distillation of Internal Representation

Takahashi, Tomokatsu, Yamada, Masanori, Yamanaka, Yuuki, Yamashita, Tomoya

Oct-31-2022–arXiv.org Artificial Intelligence

Adversarial training is the most promising method for learning robust models against adversarial examples. A recent study has shown that knowledge distillation between the same architectures is effective in improving the performance of adversarial training. Exploiting knowledge distillation is a new approach to improve adversarial training and has attracted much attention. However, its performance is still insufficient. Therefore, we propose Adversarial Robust Distillation with Internal Representation~(ARDIR) to utilize knowledge distillation even more effectively. In addition to the output of the teacher model, ARDIR uses the internal representation of the teacher model as a label for adversarial training. This enables the student model to be trained with richer, more informative labels. As a result, ARDIR can learn more robust student models. We show that ARDIR outperforms previous methods in our experiments.

artificial intelligence, machine learning, teacher model, (16 more...)

arXiv.org Artificial Intelligence

Oct-31-2022

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East > Jordan (0.04)

Genre:
- Research Report > New Finding (0.34)

Industry:
- Information Technology (0.68)
- Education (0.62)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found