Goto

Collaborating Authors

 Uncertainty



Coherent Soft Imitation Learning Joe Watson Sandy H. Huang Nicolas Heess

Neural Information Processing Systems

Imitation learning methods seek to learn from an expert either through behavioral cloning (BC) for the policy or inverse reinforcement learning (IRL) for the reward. Such methods enable agents to learn complex tasks from humans that are difficult to capture with hand-designed reward functions.






Deep Recurrent Optimal Stopping

Neural Information Processing Systems

Deep neural networks (DNNs) have recently emerged as a powerful paradigm for solving Markovian optimal stopping problems. However, a ready extension of DNN-based methods to non-Markovian settings requires significant state and parameter space expansion, manifesting the curse of dimensionality.