Hybrid Reward Architecture for Reinforcement Learning Harm van Seijen

Neural Information Processing Systems 

One of the main challenges in reinforcement learning (RL) is generalisation. In typical deep RL methods this is achieved by approximating the optimal value function with a low-dimensional representation using a deep network.