Faster Non-asymptotic Convergence for Double Q-learning

Open in new window