Goto

Collaborating Authors

 Reinforcement Learning




Generalized Hindsight for Reinforcement Learning

Neural Information Processing Systems

Intuitively, given a behavior generated under one task, Generalized Hindsight returns a different task that the behavior is better suited for. Then, the behavior is relabeled with this new task before being used by an off-policy RL optimizer.


Towards Robust Bisimulation Metric Learning

Neural Information Processing Systems

Learned representations in deep reinforcement learning (DRL) have to extract task-relevant information from complex observations, balancing between robustness to distraction and informativeness to the policy.






Automatic Curriculum Learning through Value Disagreement

Neural Information Processing Systems

Continually solving new, unsolved tasks is the key to learning diverse behaviors. Through reinforcement learning (RL), we have made massive strides towards solving tasks that have a single goal.