Two-way Deconfounder for Off-policy Evaluation in Causal Reinforcement Learning Shuguang Y u
–Neural Information Processing Systems
Before deploying any newly developed policy, it is important to assess its impact. In many high-stakes domains, it is risky or unethical to implement such policies directly for online evaluation.
Neural Information Processing Systems
Feb-16-2026, 14:16:06 GMT
- Country:
- Asia > China
- Europe > United Kingdom
- England > Greater London > London (0.04)
- North America > United States
- District of Columbia > Washington (0.04)
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Health & Medicine > Therapeutic Area
- Oncology (1.00)
- Information Technology (1.00)
- Health & Medicine > Therapeutic Area
- Technology: