Bisimulation metric for Model Predictive Control

Oct-6-2024–arXiv.org Artificial Intelligence

Model-based reinforcement learning has shown promise for improving sample efficiency and decision-making in complex environments. However, existing methods face challenges in training stability, robustness to noise, and computational efficiency. In this paper, we propose Bisimulation Metric for Model Predictive Control (BS-MPC), a novel approach that incorporates bisimulation metric loss in its objective function to directly optimize the encoder. This time-step-wise direct optimization enables the learned encoder to extract intrinsic information from the original state space while discarding irrelevant details and preventing the gradients and errors from diverging. BS-MPC improves training stability, robustness against input noise, and computational efficiency by reducing training time. We evaluate BS-MPC on both continuous control and image-based tasks from the DeepMind Control Suite, demonstrating superior performance and robustness compared to state-of-the-art baseline methods.

machine learning, reinforcement learning, td-mpc, (18 more...)

arXiv.org Artificial Intelligence

Oct-6-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States
  - Virginia (0.14)
  - California > Alameda County
    - Berkeley (0.14)
- Europe
  - Sweden (0.14)
  - France (0.14)

Genre:
- Overview > Innovation (0.34)
- Research Report
  - New Finding (0.46)
  - Promising Solution (0.34)

Industry:
- Energy > Oil & Gas > Upstream (0.61)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning
    - Reinforcement Learning (1.00)
    - Neural Networks > Deep Learning (0.48)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found