On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations

Open in new window