Distributional Reward Decomposition for Reinforcement Learning

Zichuan Lin, Li Zhao, Derek Yang, Tao Qin, Tie-Yan Liu, Guangwen Yang

Neural Information Processing Systems 

Reinforcement learning has achieved great success in decision making problems since Deep Q-learning was proposed by Mnih et al. [2015].

Similar Docs  Excel Report  more

TitleSimilaritySource
None found