Distributional Reward Decomposition for Reinforcement Learning
Zichuan Lin, Li Zhao, Derek Yang, Tao Qin, Tie-Yan Liu, Guangwen Yang
–Neural Information Processing Systems
Van Seijen et al. [2017] propose to split a state into different sub-states, each with a sub-reward obtained bytraining ageneral valuefunction, andlearnmultiple valuefunctions withsub-rewards. The architecture is rather limited due to requiring prior knowledge of how to split into sub-states.
Neural Information Processing Systems
Feb-13-2026, 01:16:51 GMT