Inverse Reinforcement Learning through Policy Gradient Minimization

Open in new window