Hindsight Optimization for Hybrid State and Action MDPs

Raghavan, Aswin (Oregon State University) | Sanner, Scott (University of Toronto) | Khardon, Roni (Tufts University) | Tadepalli, Prasad (Oregon State University) | Fern, Alan (Oregon State University)

Feb-14-2017–AAAI Conferences

Hybrid (mixed discrete and continuous) state and action Markov Decision Processes (HSA-MDPs) provide an expressive formalism for modeling stochastic and concurrent sequential decision-making problems. Existing solvers for HSA-MDPs are either limited to very restricted transition distributions, require knowledge of domain-specific basis functions to achieve good approximations, or do not scale. We explore a domain-independent approach based on the framework of hindsight optimization (HOP) for HSA-MDPs, which uses an upper bound on the finite-horizon action values for action selection. Our main contribution is a linear time reduction to a Mixed Integer Linear Program (MILP) that encodes the HOP objective, when the dynamics are specified as location-scale probability distributions parametrized by Piecewise Linear (PWL) functions of states and actions. In addition, we show how to use the same machinery to select actions based on a lower-bound generated by straight line plans. Our empirical results show that the HSA-HOP approach effectively scales to high-dimensional problems and outperforms baselines that are capable of scaling to such large hybrid MDPs.

constraint, hsa-mdp, milp, (14 more...)

AAAI Conferences

Feb-14-2017

Conferences PDF

Add feedback

Country:
- North America
  - United States
    - Oregon > Benton County
      - Corvallis (0.04)
    - Massachusetts > Middlesex County
      - Medford (0.04)
  - Canada > Ontario
    - Toronto (0.14)
- Asia > Middle East
  - Jordan (0.04)

Genre:
- Research Report > New Finding (0.34)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Optimization (0.94)
  - Machine Learning > Learning Graphical Models
    - Undirected Networks > Markov Models (0.48)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found