Convergence of Optimistic and Incremental Q-Learning

Open in new window