Goto

Collaborating Authors

 Reinforcement Learning










Reliable Off-Policy Learning for Dosage Combinations

Neural Information Processing Systems

Existing work for this task has modeled the effect of multiple treatments independently, while estimating the joint effect has received little attention but comes with non-trivial challenges. In this paper, we propose a novel method for reliable off-policy learning for dosage combinations.