Stochastic Variance Reduction for Deep Q-learning

Open in new window