Training Efficient Controllers via Analytic Policy Gradient

Open in new window