Robust and Adaptive Planning under Model Uncertainty

Sharma, Apoorva, Harrison, James, Tsao, Matthew, Pavone, Marco

Jan-8-2019–arXiv.org Artificial Intelligence

Planning under model uncertainty is a fundamental problem across many applications of decision making and learning. In this paper, we propose the Robust Adaptive Monte Carlo Planning (RAMCP) algorithm, which allows computation of risk-sensitive Bayes-adaptive policies that optimally trade off exploration, exploitation, and robustness. RAMCP formulates the risk-sensitive planning problem as a two-player zero-sum game, in which an adversary perturbs the agent's belief over the models. We introduce two versions of the RAMCP algorithm. The first, RAMCP-F, converges to an optimal risk-sensitive policy without having to rebuild the search tree as the underlying belief over models is perturbed. The second version, RAMCP-I, improves computational efficiency at the cost of losing theoretical guarantees, but is shown to yield empirical results comparable to RAMCP-F. RAMCP is demonstrated on an n-pull multi-armed bandit problem, as well as a patient treatment scenario.

algorithm, game theory, upstream oil & gas, (25 more...)

arXiv.org Artificial Intelligence

Jan-8-2019

arXiv.org PDF

Add feedback

Country:
- North America > United States > California > Santa Clara County (0.14)

Genre:
- Research Report (0.50)

Industry:
- Energy > Oil & Gas
  - Upstream (0.34)
- Health & Medicine (1.00)

Technology:
- Information Technology
  - Artificial Intelligence
    - Machine Learning > Learning Graphical Models
      - Directed Networks > Bayesian Learning (0.46)
    - Representation & Reasoning
      - Optimization (0.93)
      - Search (1.00)
      - Uncertainty (1.00)
  - Data Science > Data Mining
    - Big Data (0.89)
  - Game Theory (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found