Bisimulation metric for Model Predictive Control
Shimizu, Yutaka, Tomizuka, Masayoshi
–arXiv.org Artificial Intelligence
Model-based reinforcement learning has shown promise for improving sample efficiency and decision-making in complex environments. However, existing methods face challenges in training stability, robustness to noise, and computational efficiency. In this paper, we propose Bisimulation Metric for Model Predictive Control (BS-MPC), a novel approach that incorporates bisimulation metric loss in its objective function to directly optimize the encoder. This time-step-wise direct optimization enables the learned encoder to extract intrinsic information from the original state space while discarding irrelevant details and preventing the gradients and errors from diverging. BS-MPC improves training stability, robustness against input noise, and computational efficiency by reducing training time. We evaluate BS-MPC on both continuous control and image-based tasks from the DeepMind Control Suite, demonstrating superior performance and robustness compared to state-of-the-art baseline methods.
arXiv.org Artificial Intelligence
Oct-6-2024
- Country:
- North America > United States
- Virginia (0.14)
- California > Alameda County
- Berkeley (0.14)
- Europe
- North America > United States
- Genre:
- Overview > Innovation (0.34)
- Research Report
- New Finding (0.46)
- Promising Solution (0.34)
- Technology: