Risk Aversion in Markov Decision Processes via Near Optimal Chernoff Bounds

Dec-31-2012–Neural Information Processing Systems

The expected return is a widely used objective in decision making under uncertainty. Manyalgorithms, such as value iteration, have been proposed to optimize it. In risk-aware settings, however, the expected return is often not an appropriate objective to optimize. We propose a new optimization objective for risk-aware planning and show that it has desirable theoretical properties. We also draw connections topreviously proposed objectives for risk-aware planing: minmax, exponential utility,percentile and mean minus variance. Our method applies to an extended class of Markov decision processes: we allow costs to be stochastic as long as they are bounded. Additionally, we present an efficient algorithm for optimizing theproposed objective. Synthetic and real-world experiments illustrate the effectiveness of our method, at scale.

artificial intelligence, machine learning, optimization, (14 more...)

Neural Information Processing Systems

Dec-31-2012

Conferences PDF

Add feedback

Country:
- North America > United States > California (0.14)

Industry:
- Consumer Products & Services > Travel (1.00)
- Transportation
  - Passenger (1.00)
  - Air (1.00)
  - Infrastructure & Services > Airport (0.47)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning
    - Uncertainty (0.48)
    - Optimization (0.47)
  - Machine Learning > Learning Graphical Models
    - Undirected Networks > Markov Models (0.62)

Duplicate Docs Excel Report

Title
Risk Aversion in Markov Decision Processes via Near Optimal Chernoff Bounds
Risk Aversion in Markov Decision Processes via Near Optimal Chernoff Bounds
Risk Aversion in Markov Decision Processes via Near-Optimal Chernoff Bounds

Similar Docs Excel Report more

Title	Similarity	Source
None found