Hybrid Reward Architecture for Reinforcement Learning Harm van Seijen