Learning Reward Machines for Partially Observable Reinforcement Learning

Neural Information Processing Systems 

The use of neural networks for function approximation has led to many recent advances in Reinforcement Learning (RL) .