Convergence of a Q-learning Variant for Continuous States and Actions

Open in new window