Practical Risk Measures in Reinforcement Learning
Di Castro, Dotan, Oren, Joel, Mannor, Shie
Practical application of Reinforcement Learning (RL) often involves risk considerations. We study a generalized approximation scheme for risk measures, based on Monte-Carlo simulations, where the risk measures need not necessarily be \emph{coherent}. We demonstrate that, even in simple problems, measures such as the variance of the reward-to-go do not capture the risk in a satisfactory manner. In addition, we show how a risk measure can be derived from model's realizations. We propose a neural architecture for estimating the risk and suggest the risk critic architecture that can be use to optimize a policy under general risk measures. We conclude our work with experiments that demonstrate the efficacy of our approach.
Aug-22-2019
- Country:
- Asia > Middle East
- Israel (0.14)
- North America > United States
- Massachusetts > Middlesex County (0.14)
- Asia > Middle East
- Genre:
- Research Report (1.00)
- Technology: