Multi-time Models for Temporally Abstract Planning

Precup, Doina, Sutton, Richard S.

Neural Information Processing Systems 

The Natural abstract actions are to move from room to room. 1 Reinforcement Learning (MDP) Framework In reinforcement learning, a learning agent interacts with an environment at some discrete, lowest-level time scale t 0,1,2, ... On each time step, the agent perceives the state of the environment, St, and on that basis chooses a primitive action, at.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found