Faster Non-asymptotic Convergence for Double Q-learning

Apr-25-2026, 12:42:34 GMT–Neural Information Processing Systems

Double Q-learning (Hasselt, 2010) has gained significant success in practice due to its effectiveness in overcoming the overestimation issue of Q-learning. However, the theoretical understanding of double Q-learning is rather limited. The only existing finite-time analysis was recently established in (Xiong et al., 2020), where the polynomial learning rate adopted in the analysis typically yields a slower convergence rate. This paper tackles the more challenging case of a constant learning rate, and develops new analytical tools that improve the existing convergence rate by orders of magnitude.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Apr-25-2026, 12:42:34 GMT

Conferences PDF

Add feedback

Genre:
- Research Report (0.47)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
3b712de48137572f3849aabd5666a4e3-Paper.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found