Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning