On Gaussian approximation for entropy-regularized Q-learning with function approximation

Open in new window