On Dynamic Programming Decompositions of Static Risk Measures in Markov Decision Processes
–Neural Information Processing Systems
Risk-averse reinforcement learning (RL) seeks to provide a risk-averse policy for high-stakes real-world decision problems. These high-stake domains include autonomous driving (Jin et al., 2019; Sharma et al., 2020), robot collision avoidance (Ahmadi et al., 2021; Hakobyan and Y ang, 2021),
Neural Information Processing Systems
Oct-9-2025, 03:25:22 GMT
- Country:
- Asia
- China (0.04)
- Middle East > Republic of Türkiye (0.04)
- Europe > Switzerland
- Basel-City > Basel (0.04)
- North America
- Canada > Quebec
- Montreal (0.04)
- United States
- California > Santa Clara County
- Palo Alto (0.04)
- New Hampshire (0.04)
- California > Santa Clara County
- Canada > Quebec
- Asia
- Genre:
- Research Report (0.46)
- Industry:
- Health & Medicine > Therapeutic Area
- Immunology (0.46)
- Information Technology (0.87)
- Transportation (0.54)
- Health & Medicine > Therapeutic Area
- Technology: