On the Curses of Future and History in Future-dependent Value Functions for OPE
–Neural Information Processing Systems
We study off-policy evaluation (OPE) in partially observable environments with complex observations, with the goal of developing estimators whose guarantee avoids exponential dependence on the horizon.
Neural Information Processing Systems
Feb-18-2026, 10:43:15 GMT
- Country:
- North America > United States
- Illinois > Champaign County
- Urbana (0.04)
- Indiana > Tippecanoe County
- Lafayette (0.04)
- Illinois > Champaign County
- North America > United States
- Genre:
- Research Report > Experimental Study (0.92)
- Workflow (0.68)