Goto

Collaborating Authors

 Reinforcement Learning



The MAGICAL Benchmark for Robust Imitation

Neural Information Processing Systems

The robot could learn from these demonstrations to complete the tasks autonomously. For IL algorithms to be useful, however, they must be able to learn how to perform tasks from few demonstrations. A domestic robot wouldn't be very helpful if it required thirty demonstrations before it figured out that you are deliberately washing your purple cravat





On the Statistical Efficiency of Reward-Free Exploration in Non-Linear RL

Neural Information Processing Systems

Our analyses indicate that the explorability or reachability assumptions, previously made for the latter two settings, are not necessary statistically for reward-free exploration.