A Convergent Form of Approximate Policy Iteration

Open in new window