Off-policy Policy Evaluation For Sequential Decisions Under Unobserved Confounding
–Neural Information Processing Systems
In order to make counterfactual evaluations possible, a standard assumption--albeit often overlooked and unstated--is to require that the behavior policy does not depend on any unobserved variables that also affect the future states/rewards (no unobserved confounding).
Neural Information Processing Systems
Nov-20-2025, 09:46:50 GMT
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America
- Canada (0.04)
- United States
- California
- Los Angeles County > Long Beach (0.04)
- Santa Clara County > Palo Alto (0.04)
- Florida > Palm Beach County
- Boca Raton (0.04)
- California
- Europe > United Kingdom
- Genre:
- Research Report
- Experimental Study (0.68)
- New Finding (0.67)
- Research Report
- Industry:
- Technology: