Finite Sample Analysis of the GTD Policy Evaluation Algorithms in Markov Setting

Open in new window