Learning Macro-Actions in Reinforcement Learning
–Neural Information Processing Systems
We present a method for automatically constructing macro-actions from scratch from primitive actions during the reinforcement learning process. The overall idea is to reinforce the tendency to perform action b after action a if such a pattern of actions has been rewarded. We test the method on a bicycle task, the car-on-the-hill task, the racetrack task and some grid-world tasks. For the bicycle and racetrack tasks the use of macro-actions approximately halves the learning time, while for one of the grid-world tasks the learning time is reduced by a factor of 5. The method did not work for the car-on-the-hill task for reasons we discuss in the conclusion.
Neural Information Processing Systems
Dec-31-1999
- Country:
- Europe
- Denmark > Capital Region
- Copenhagen (0.05)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.05)
- Denmark > Capital Region
- North America > United States
- Massachusetts (0.05)
- Europe
- Genre:
- Research Report (0.46)
- Technology: