Truncating Trajectories in Monte Carlo Policy Evaluation: an Adaptive Approach

Open in new window