PlanGAN: Model-based Planning With Sparse Rewards and Multiple Goals

Oct-10-2024, 08:26:44 GMT–Neural Information Processing Systems

Learning with sparse rewards remains a significant challenge in reinforcement learning (RL), especially when the aim is to train a policy capable of achieving multiple different goals. To date, the most successful approaches for dealing with multi-goal, sparse reward environments have been model-free RL algorithms. In this work we propose PlanGAN, a model-based algorithm specifically designed for solving multi-goal tasks in environments with sparse rewards. Our method builds on the fact that any trajectory of experience collected by an agent contains useful information about how to achieve the goals observed during that trajectory. We use this to train an ensemble of conditional generative models (GANs) to generate plausible trajectories that lead the agent from its current state towards a specified goal.

plangan, sparse reward and multiple goal, trajectory, (3 more...)

Neural Information Processing Systems

Oct-10-2024, 08:26:44 GMT

Conferences Web Page

Add feedback

Country:
- North America > United States > Montana (0.09)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.68)