An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient Yudong Luo
–Neural Information Processing Systems
Restricting the variance of a policy's return is a popular choice in risk-averse Reinforcement Learning (RL) due to its clear mathematical definition and easy interpretability. Traditional methods directly restrict the total return variance. Recent methods restrict the per-step reward variance as a proxy.
Neural Information Processing Systems
Oct-9-2025, 06:23:02 GMT
- Country:
- Asia > China
- Guangdong Province > Shenzhen (0.04)
- Hong Kong (0.04)
- Europe > United Kingdom
- England > Oxfordshire > Oxford (0.04)
- North America > Canada
- Asia > China
- Industry:
- Government (0.46)
- Technology: