Provably Efficient Neural GTD Algorithm for Off-policy Learning

Neural Information Processing Systems 

The resultant formulation can then be tackled using a neural GTD algorithm.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found