Discovering and Overcoming Limitations of Noise-engineered Data-free Knowledge Distillation

Apr-25-2026, 00:49:11 GMT–Neural Information Processing Systems

Distillation in neural networks using only the samples randomly drawn from a Gaussian distribution is possibly the most straightforward solution one can think of for the complex problem of knowledge transfer from one network (teacher) to the other (student). If successfully done, it can eliminate the requirement of teacher's training data for knowledge distillation and avoid often arising privacy concerns in sensitive applications such as healthcare. There have been some recent attempts at Gaussian noise-based data-free knowledge distillation, however, none of them offer a consistent or reliable solution. We identify the shift in the distribution of hidden layer activation as the key limiting factor, which occurs when Gaussian noise is fed to the teacher network instead of the accustomed training data. We propose a simple solution to mitigate this shift and show that for vision tasks, such as classification, it is possible to achieve a performance close to the teacher by just using the samples randomly drawn from a Gaussian distribution.

artificial intelligence, gaussian noise, machine learning, (14 more...)

Neural Information Processing Systems

Apr-25-2026, 00:49:11 GMT

Conferences PDF

Add feedback

Industry:
- Education (0.94)
- Information Technology > Security & Privacy (0.66)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Duplicate Docs Excel Report

Title
1f96b24df4b06f5d68389845a9a13ed9-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found