KD-Zero: Evolving Knowledge Distiller for Any Teacher-Student Pairs
–Neural Information Processing Systems
Knowledge distillation (KD) has emerged as an effective technique for compressing models that can enhance the lightweight model. Conventional KD methods propose various designs to allow student model to imitate the teacher better.
Neural Information Processing Systems
Oct-9-2025, 09:09:50 GMT
- Country:
- Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
- Genre:
- Research Report (0.93)
- Industry:
- Education (0.88)
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science (0.71)
- Machine Learning
- Evolutionary Systems (0.68)
- Neural Networks > Deep Learning (0.46)
- Natural Language (0.67)
- Representation & Reasoning > Search (0.71)
- Vision (1.00)
- Information Technology > Artificial Intelligence