Learning to Predict Without Looking Ahead: World Models Without Forward Prediction
–Neural Information Processing Systems
In this work, we introduce a modification to traditional reinforcement learning which we call observational dropout, whereby we limit the agents ability to observe the real environment at each timestep.
Neural Information Processing Systems
Oct-2-2025, 05:11:48 GMT
- Country:
- Europe > Italy
- North America
- Canada (0.04)
- United States (0.04)
- Industry:
- Leisure & Entertainment (0.46)
- Technology: