Finite-Time Analysis for Double Q-learning

Aug-16-2025, 05:37:12 GMT–Neural Information Processing Systems

Theoretical performance of Q-learning has also been intensively explored. The asymptotic convergence has been established in Tsitsiklis (1994); Jaakkola et al. (1994); Borkar and Meyn (2000); Melo (2001); Lee and He (2019).

double q-learning, q-learning, state-action pair, (13 more...)

Neural Information Processing Systems

Aug-16-2025, 05:37:12 GMT

Conferences PDF

Add feedback

Country:
- North America
  - United States > Ohio (0.04)
  - Canada (0.04)
- Asia
  - Singapore (0.04)
  - Middle East > Jordan (0.04)
  - China > Guangdong Province
    - Shenzhen (0.04)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
Finite-Time Analysis for Double Q-learning

Similar Docs Excel Report more

Title	Similarity	Source
None found