A Diffusion Approximation for Temporal-Difference Learning with Linear Features under Markovian Noise

Open in new window