The Asymptotic Convergence-Rate of Q-learning

Szepesvári, Csaba

Neural Information Processing Systems 

R Pmin/Pmax is the ratio of the minimum and maximum state-action occupation frequencies.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found