Beyondthe Return: Off-policy Function Estimation under User-specified Error-measuring Distributions

Neural Information Processing Systems 

Theorem 6.Suppose Assumptions 4, 5, 6, hold.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found