Future-Dependent Value-Based Off-Policy Evaluation in POMDPs
–Neural Information Processing Systems
Existing methods such as sequential importance sampling estimators suffer from the curse of horizon in POMDPs.
Neural Information Processing Systems
Oct-8-2025, 10:27:49 GMT
- Country:
- Asia > Japan
- Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
- North America > United States
- Illinois (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Asia > Japan
- Genre:
- Instructional Material (0.46)
- Research Report (0.67)
- Technology: