Goto

Collaborating Authors

 Reinforcement Learning









Replicable Reinforcement Learning

Neural Information Processing Systems

The replicability crisis in the social, behavioral, and data sciences has led to the formulation of algorithm frameworks for replicability -- i.e., a requirement that an algorithm produce identical outputs (with high probability) when run on two


Coherent Soft Imitation Learning Joe Watson Sandy H. Huang Nicolas Heess

Neural Information Processing Systems

Imitation learning methods seek to learn from an expert either through behavioral cloning (BC) for the policy or inverse reinforcement learning (IRL) for the reward. Such methods enable agents to learn complex tasks from humans that are difficult to capture with hand-designed reward functions.