An Alternative to Variance: Gini Deviation for Risk-averse Policy Gradient Yudong Luo

Oct-9-2025, 06:23:02 GMT–Neural Information Processing Systems

Restricting the variance of a policy's return is a popular choice in risk-averse Reinforcement Learning (RL) due to its clear mathematical definition and easy interpretability. Traditional methods directly restrict the total return variance. Recent methods restrict the per-step reward variance as a proxy.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Oct-9-2025, 06:23:02 GMT

Conferences PDF

Add feedback

Country:
- North America > Canada
  - Alberta (0.14)
  - Ontario (0.04)
- Europe > United Kingdom
  - England > Oxfordshire > Oxford (0.04)
- Asia > China
  - Hong Kong (0.04)
  - Guangdong Province > Shenzhen (0.04)

Industry:
- Government (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Optimization (0.92)
  - Machine Learning
    - Statistical Learning (0.92)
    - Reinforcement Learning (0.89)

Duplicate Docs Excel Report

Title
bf665e1cf271faa5037374c884ba3808-Paper-Conference.pdf

Similar Docs Excel Report more

Title	Similarity	Source
None found