On Pathologies in KL-Regularized Reinforcement Learning from Expert Demonstrations