Reward Is Not Enough for Risk-Averse Reinforcement Learning

Nov-26-2022, 00:10:18 GMT–#artificialintelligence

TL;DR: Risk-aversion is essential in many RL applications (e.g., driving, robotic surgery and finance). Some modified RL frameworks consider risk (e.g., by optimizing a risk-measure of the return instead of its expectation), but pose new algorithmic challenges. Instead, it is often suggested to stick with the old and good RL framework, and just set the rewards such that negative outcomes are amplified. Unfortunately, as discussed below, modeling risk using expectation over redefined rewards is often unnatural, impractical or even mathematically impossible, hence cannot replace explicit optimization of risk-measures. This is consistent with similar results from decision theory, where risk optimization is not equivalent to expected utility maximization.

machine learning, optimization, reinforcement learning, (18 more...)

#artificialintelligence

Nov-26-2022, 00:10:18 GMT

News Web Page

Add feedback

Industry:
- Banking & Finance (0.71)
- Health & Medicine (0.56)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.51)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found