Analyzing the Confidentiality of Undistillable Teachers in Knowledge Distillation

Oct-10-2024, 07:11:14 GMT–Neural Information Processing Systems

Knowledge distillation (KD) has recently been identified as a method that can unintentionally leak private information regarding the details of a teacher model to an unauthorized student. Recent research in developing undistillable nasty teachers that can protect model confidentiality has gained significant attention. However, the level of protection these nasty models offer has been largely untested. In this paper, we show that transferring knowledge to a shallow sub-section of a student can largely reduce a teacher's influence. By exploring the depth of the shallow subsection, we then present a distillation technique that enables a skeptical student model to learn even from a nasty teacher.

knowledge distillation, nasty teacher, student, (6 more...)

Neural Information Processing Systems

Oct-10-2024, 07:11:14 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.61)