Distributional Reinforcement Learning for Risk-Sensitive Policies

Neural Information Processing Systems 

On both synthetic and real data, we empirically show that our proposed algorithm is able to learn better CV aR-optimized policies.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found