The Role of Lookahead and Approximate Policy Evaluation in Policy Iteration with Linear Value Function Approximation

Open in new window