Decoupled Kullback-Leibler Divergence Loss
–Neural Information Processing Systems
Firstly, we address the limitation of KL/DKL in scenarios like knowledge distillation by breaking its asymmetric optimization property.
Neural Information Processing Systems
Oct-10-2025, 08:39:03 GMT
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Information Technology > Security & Privacy (0.46)
- Technology: