Variance Reduction for Deep Q-Learning using Stochastic Recursive Gradient

Open in new window