Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling

Tengyang Xie, Yifei Ma, Yu-Xiang Wang

Neural Information Processing Systems 

Neural Information Processing Systems http://nips.cc/

Similar Docs  Excel Report  more

TitleSimilaritySource
None found