Q-Learning with Hidden-Unit Restarting

Dec-31-1993–Neural Information Processing Systems

Platt's resource-allocation network (RAN) (Platt, 1991a, 1991b) is modified for a reinforcement-learning paradigm and to "restart" existing hidden units rather than adding new units. After restarting, unitscontinue to learn via back-propagation. The resulting restart algorithm is tested in a Q-Iearning network that learns to solve an inverted pendulum problem. Solutions are found faster on average with the restart algorithm than without it.

algorithm, artificial intelligence, reinforcement learning, (14 more...)

Neural Information Processing Systems

Dec-31-1993

Conferences PDF

Add feedback

Country:
- North America > United States > Massachusetts
  - Hampshire County > Amherst (0.14)
  - Middlesex County (0.14)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
Q-Learning with Hidden-Unit Restarting
Q-Learning with Hidden-Unit Restarting

Similar Docs Excel Report more

Title	Similarity	Source
None found