Generalization in Reinforcement Learning with Selective Noise Injection and Information Bottleneck

Oct-11-2024, 03:20:49 GMT–Neural Information Processing Systems

The ability for policies to generalize to new environments is key to the broad application of RL agents. A promising approach to prevent an agent's policy from overfitting to a limited set of training environments is to apply regularization techniques originally developed for supervised learning. However, there are stark differences between supervised learning and RL. We discuss those differences and propose modifications to existing regularization techniques in order to better adapt them to RL. In particular, we focus on regularization techniques relying on the injection of noise into the learned function, a family that includes some of the most widely used approaches such as Dropout and Batch Normalization.

noise injection and information bottleneck, regularization technique, selective noise injection, (4 more...)

Neural Information Processing Systems

Oct-11-2024, 03:20:49 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.79)