Posterior Sampling for Large Scale Reinforcement Learning