Goto

Collaborating Authors

 Reinforcement Learning








Asymptotically Exact Error Characterization of Offline Policy Evaluation with Misspecified Linear Models

Neural Information Processing Systems

Recently, theoretical understanding of OPE has been rapidly advanced under (approximate) realizability assumptions, i.e., where the environments of interest are well approximated with the given hypothetical models.