Near Optimal Policy Optimizationvia REPS