TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models

Open in new window