Robust and Adaptive Planning under Model Uncertainty
Sharma, Apoorva, Harrison, James, Tsao, Matthew, Pavone, Marco
–arXiv.org Artificial Intelligence
Planning under model uncertainty is a fundamental problem across many applications of decision making and learning. In this paper, we propose the Robust Adaptive Monte Carlo Planning (RAMCP) algorithm, which allows computation of risk-sensitive Bayes-adaptive policies that optimally trade off exploration, exploitation, and robustness. RAMCP formulates the risk-sensitive planning problem as a two-player zero-sum game, in which an adversary perturbs the agent's belief over the models. We introduce two versions of the RAMCP algorithm. The first, RAMCP-F, converges to an optimal risk-sensitive policy without having to rebuild the search tree as the underlying belief over models is perturbed. The second version, RAMCP-I, improves computational efficiency at the cost of losing theoretical guarantees, but is shown to yield empirical results comparable to RAMCP-F. RAMCP is demonstrated on an n-pull multi-armed bandit problem, as well as a patient treatment scenario.
arXiv.org Artificial Intelligence
Jan-8-2019
- Country:
- North America > United States > California > Santa Clara County (0.14)
- Genre:
- Research Report (0.50)
- Industry:
- Energy > Oil & Gas
- Upstream (0.34)
- Health & Medicine (1.00)
- Energy > Oil & Gas
- Technology: