Model-Based Reinforcement Learning via Imagination with Derived Memory
–Neural Information Processing Systems
Model-based reinforcement learning aims to improve the sample efficiency of policy learning by modeling the dynamics of the environment. Recently, the latent dynamics model is further developed to enable fast planning in a compact space. It summarizes the high-dimensional experiences of an agent, which mimics the memory function of humans. Learning policies via imagination with the latent model shows great potential for solving complex tasks. However, only considering memories from the true experiences in the process of imagination could limit its advantages.
Neural Information Processing Systems
Oct-10-2024, 07:59:17 GMT
- Technology: