EBGAN-MDN: An Energy-Based Adversarial Framework for Multi-Modal Behavior Cloning
Li, Yixiao, Barth, Julia, Kiefer, Thomas, Fraij, Ahmad
–arXiv.org Artificial Intelligence
Multi-modal behavior cloning faces significant challenges due to mode averaging and mode collapse, where traditional models fail to capture diverse input-output mappings. This problem is critical in applications like robotics, where modeling multiple valid actions ensures both performance and safety. We propose EBGAN-MDN, a framework that integrates energy-based models, Mixture Density Networks (MDNs), and adversarial training. By leveraging a modified InfoNCE loss and an energy-enforced MDN loss, EBGAN-MDN effectively addresses these challenges. Experiments on synthetic and robotic benchmarks demonstrate superior performance, establishing EBGAN-MDN as a effective and efficient solution for multi-modal learning tasks.
arXiv.org Artificial Intelligence
Oct-10-2025
- Genre:
- Instructional Material > Course Syllabus & Notes (0.46)
- Research Report (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning > Neural Networks (1.00)
- Representation & Reasoning (1.00)
- Robots (1.00)
- Information Technology > Artificial Intelligence