Policy-shaped prediction: avoiding distractions in model-based reinforcement learning

Neural Information Processing Systems 

Model-based reinforcement learning (MBRL) is a promising route to sample-efficient policy optimization. However, a known vulnerability of reconstruction-based MBRL consists of scenarios in which detailed aspects of the world are highly predictable, but irrelevant to learning a good policy. Such scenarios can lead the model to exhaust its capacity on meaningless content, at the cost of neglecting important environment dynamics.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found