Automatic Data Augmentation for Generalization in Reinforcement Learning
–Neural Information Processing Systems
Deep reinforcement learning (RL) agents often fail to generalize beyond their training environments. To alleviate this problem, recent work has proposed the use of data augmentation. However, different tasks tend to benefit from different types of augmentations and selecting the right one typically requires expert knowledge. In this paper, we introduce three approaches for automatically finding an effective augmentation for any RL task. These are combined with two novel regularization terms for the policy and value function, required to make the use of data augmentation theoretically sound for actor-critic algorithms.
Neural Information Processing Systems
May-20-2025, 19:41:03 GMT
- Country:
- North America > United States (0.14)
- Genre:
- Research Report (0.46)
- Industry:
- Education (0.48)
- Technology: