Posterior Sampling with Delayed Feedback for Reinforcement Learning with Linear Function Approximation

Open in new window