Recursive Reinforcement Learning

Name

Neural Information Processing Systems 

These RL algorithms are designed with a flat Markovian view of the environment in the form of a "state, action, reward, and next state" interface [