State-Aware Variational Thompson Sampling for Deep Q-Networks

Open in new window