Variational Distillation of Diffusion Policies into Mixture of Experts Denis Blessing
–Neural Information Processing Systems
This work introduces Variational Diffusion Distillation (VDD), a novel method that distills denoising diffusion policies into Mixtures of Experts (MoE) through variational inference. Diffusion Models are the current state-of-the-art in generative modeling due to their exceptional ability to accurately learn and represent complex, multi-modal distributions. This ability allows Diffusion Models to replicate the inherent diversity in human behavior, making them the preferred models in behavior learning such as Learning from Human Demonstrations (LfD). However, diffusion models come with some drawbacks, including the intractability of likelihoods and long inference times due to their iterative sampling process. The inference times, in particular, pose a significant challenge to real-time applications such as robot control. In contrast, MoEs effectively address the aforementioned issues while retaining the ability to represent complex distributions but are notoriously difficult to train.
Neural Information Processing Systems
May-21-2025, 11:33:41 GMT
- Country:
- Europe > Germany > Baden-Württemberg (0.14)
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Education (0.46)
- Technology: