Near Optimal Policy Optimization via REPS

Open in new window