Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning

Open in new window