An Online Prediction Algorithm for Reinforcement Learning with Linear Function Approximation using Cross Entropy Method

Open in new window