ComprehensiveKnowledgeDistillation withCausalIntervention
–Neural Information Processing Systems
Although theteacher haslearned rich and powerful representations, it also contains unignorable bias knowledge which is usually induced by the context prior (e.g., background) in the training data.
Neural Information Processing Systems
Feb-10-2026, 21:47:14 GMT