Q-Learning with Hidden-Unit Restarting
–Neural Information Processing Systems
Platt's resource-allocation network (RAN) (Platt, 1991a, 1991b) is modified for a reinforcement-learning paradigm and to "restart" existing hidden units rather than adding new units. After restarting, units continue to learn via back-propagation. The resulting restart algorithm is tested in a Q-Iearning network that learns to solve an inverted pendulum problem. Solutions are found faster on average with the restart algorithm than without it.
Neural Information Processing Systems
Dec-31-1993
- Country:
- North America > United States
- District of Columbia > Washington (0.04)
- Massachusetts
- Hampshire County > Amherst (0.14)
- Middlesex County
- Colorado > Larimer County
- Fort Collins (0.04)
- California > San Mateo County
- San Mateo (0.05)
- Europe > United Kingdom
- England > Cambridgeshire > Cambridge (0.04)
- Asia > Middle East
- Jordan (0.05)
- North America > United States
- Technology: