A Non-asymptotic Analysis of Non-parametric Temporal-Difference Learning

Neural Information Processing Systems 

Temporal-difference learning is a popular algorithm for policy evaluation.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found