Finite-Time Analysis for Double Q-learning
–Neural Information Processing Systems
Theoretical performance of Q-learning has also been intensively explored. The asymptotic convergence has been established in Tsitsiklis (1994); Jaakkola et al. (1994); Borkar and Meyn (2000); Melo (2001); Lee and He (2019).
Neural Information Processing Systems
Aug-16-2025, 05:37:12 GMT
- Country:
- Asia
- China > Guangdong Province
- Shenzhen (0.04)
- Middle East > Jordan (0.04)
- Singapore (0.04)
- China > Guangdong Province
- North America
- Canada (0.04)
- United States > Ohio (0.04)
- Asia
- Technology: