Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning Shuguang Y u

Neural Information Processing Systems 

Before deploying any newly developed policy, it is important to assess its impact. In many high-stakes domains, it is risky or unethical to implement such policies directly for online evaluation.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found