Relative Entropy Regularized Policy Iteration

Open in new window