Actor-Critic Algorithms for Risk-Sensitive MDPs

Feb-14-2020, 14:11:03 GMT–Neural Information Processing Systems

In many sequential decision-making problems we may want to manage risk by minimizing some measure of variability in rewards in addition to maximizing a standard criterion. Variance related risk measures are among the most common risk-sensitive criteria in finance and operations research. However, optimizing many such criteria is known to be a hard problem. In this paper, we consider both discounted and average reward Markov decision processes. For each formulation, we first define a measure of variability for a policy, which in turn gives us a set of risk-sensitive criteria to optimize.

actor-critic algorithm, criteria, risk-sensitive mdp, (2 more...)

Neural Information Processing Systems

Feb-14-2020, 14:11:03 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.90)