On Evaluating Policies for Robust POMDPs
–Neural Information Processing Systems
Robust partially observable Markov decision processes (RPOMDPs) model sequential decision-making problems under partial observability, where an agent must be robust against a range of dynamics. RPOMDPs can be viewed as a two-player game between an agent, who selects actions, and nature, who adversarially selects the dynamics. Evaluating an agent policy requires finding an adversarial nature policy, which is computationally challenging. In this paper, we advance the evaluation of agent policies for RPOMDPs in three ways. First, we discuss suitable benchmarks.
Neural Information Processing Systems
Jun-17-2026, 07:36:55 GMT
- Country:
- Europe > Netherlands (0.28)
- Genre:
- Research Report
- Experimental Study (0.67)
- New Finding (0.67)
- Research Report
- Industry:
- Health & Medicine (0.67)
- Technology: