On the Impact of Knowledge Distillation for Model Interpretability

Han, Hyeongrok, Kim, Siwon, Choi, Hyun-Soo, Yoon, Sungroh

May-25-2023–arXiv.org Artificial Intelligence

Several recent studies have elucidated why knowledge distillation (KD) improves model performance. However, few have researched the other advantages of KD in addition to its improving model performance. In this study, we have attempted to show that KD enhances the interpretability as well as the accuracy of models. We measured the number of concept detectors identified in network dissection for a quantitative comparison of model interpretability. We attributed the improvement in interpretability to the class-similarity information transferred from the teacher to student models. First, we confirmed the transfer of class-similarity information from the teacher to student model via logit distillation. Then, we analyzed how class-similarity information affects model interpretability in terms of its presence or absence and degree of similarity information. We conducted various quantitative and qualitative experiments and examined the results on different datasets, different KD methods, and according to different measures of interpretability. Our research showed that KD models by large models could be used more reliably in various fields.

data mining, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

May-25-2023

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Hawaii > Honolulu County > Honolulu (0.04)
- Asia > South Korea
  - Seoul > Seoul (0.05)
  - Gangwon-do > Chuncheon (0.04)

Genre:
- Research Report > New Finding (1.00)

Industry:
- Education (1.00)

Technology:
- Information Technology
  - Data Science > Data Mining (0.93)
  - Sensing and Signal Processing > Image Processing (0.67)
  - Artificial Intelligence
    - Representation & Reasoning (1.00)
    - Natural Language > Explanation & Argumentation (0.93)
    - Vision (0.68)
    - Machine Learning
      - Neural Networks > Deep Learning (0.94)
      - Performance Analysis (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found