Q-Learning with Hidden-Unit Restarting

Dec-31-1993–Neural Information Processing Systems

Platt's resource-allocation network (RAN) (Platt, 1991a, 1991b) is modified for a reinforcement-learning paradigm and to "restart" existing hidden units rather than adding new units. After restarting, units continue to learn via back-propagation. The resulting restart algorithm is tested in a Q-Iearning network that learns to solve an inverted pendulum problem. Solutions are found faster on average with the restart algorithm than without it.

algorithm, artificial intelligence, reinforcement learning, (14 more...)

Neural Information Processing Systems

Dec-31-1993

Conferences PDF

Add feedback

Duplicate Docs Excel Report

Title
Q-Learning with Hidden-Unit Restarting
Q-Learning with Hidden-Unit Restarting

Similar Docs Excel Report more

Title	Similarity	Source
None found