Multi-time Models for Temporally Abstract Planning
Precup, Doina, Sutton, Richard S.
–Neural Information Processing Systems
The Natural abstract actions are to move from room to room. 1 Reinforcement Learning (MDP) Framework In reinforcement learning, a learning agent interacts with an environment at some discrete, lowest-level time scale t 0,1,2, ... On each time step, the agent perceives the state of the environment, St, and on that basis chooses a primitive action, at.
Neural Information Processing Systems
Dec-31-1998
- Country:
- North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
- Technology: