Statistical Inference of the Value Function for Reinforcement Learning in Infinite Horizon Settings

Open in new window