Off-Policy Evaluation for Episodic Partially Observable Markov Decision Processes under Non-Parametric Models
–Neural Information Processing Systems
We study the problem of off-policy evaluation (OPE) for episodic Partially Observable Markov Decision Processes (POMDPs) with continuous states.
Neural Information Processing Systems
Oct-1-2025, 21:07:12 GMT
- Country:
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- North America > United States
- California > Orange County > Irvine (0.04)
- Europe > United Kingdom
- Technology: