Policy-shaped prediction: avoiding distractions in model-based reinforcement learning
–Neural Information Processing Systems
Model-based reinforcement learning (MBRL) is a promising route to sample-efficient policy optimization. However, a known vulnerability of reconstruction-based MBRL consists of scenarios in which detailed aspects of the world are highly predictable, but irrelevant to learning a good policy. Such scenarios can lead the model to exhaust its capacity on meaningless content, at the cost of neglecting important environment dynamics.
Neural Information Processing Systems
Nov-14-2025, 03:33:05 GMT
- Country:
- Europe > Italy
- Marche > Ancona Province > Ancona (0.04)
- North America > United States
- California > Santa Clara County > Palo Alto (0.04)
- Europe > Italy
- Genre:
- Research Report > Experimental Study (0.93)
- Industry:
- Information Technology (0.46)
- Leisure & Entertainment > Games (0.46)
- Technology: