Generalized Hindsight for Reinforcement Learning

Neural Information Processing Systems 

Intuitively, given a behavior generated under one task, Generalized Hindsight returns a different task that the behavior is better suited for. Then, the behavior is relabeled with this new task before being used by an off-policy RL optimizer.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found