Q-Learning with Hidden-Unit Restarting
–Neural Information Processing Systems
Platt's resource-allocation network (RAN) (Platt, 1991a, 1991b) is modified for a reinforcement-learning paradigm and to "restart" existing hidden units rather than adding new units. After restarting, unitscontinue to learn via back-propagation. The resulting restart algorithm is tested in a Q-Iearning network that learns to solve an inverted pendulum problem. Solutions are found faster on average with the restart algorithm than without it.
Neural Information Processing Systems
Dec-31-1993
- Country:
- North America > United States > Massachusetts
- Hampshire County > Amherst (0.14)
- Middlesex County (0.14)
- North America > United States > Massachusetts
- Technology: