Q-Learning with Hidden-Unit Restarting

Dec-31-1993–Neural Information Processing Systems

Platt's resource-allocation network (RAN) (Platt, 1991a, 1991b) is modified for a reinforcement-learning paradigm and to "restart" existing hidden units rather than adding new units. After restarting, units continue to learn via back-propagation. The resulting restart algorithm is tested in a Q-Iearning network that learns to solve an inverted pendulum problem. Solutions are found faster on average with the restart algorithm than without it.

algorithm, pendulum, restart, (14 more...)

Neural Information Processing Systems

Dec-31-1993

Conferences PDF

Add feedback

Country:
- North America > United States
  - District of Columbia > Washington (0.04)
  - Massachusetts
    - Hampshire County > Amherst (0.14)
    - Middlesex County
      - Waltham (0.04)
      - Cambridge (0.04)
  - Colorado > Larimer County
    - Fort Collins (0.04)
  - California > San Mateo County
    - San Mateo (0.05)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)
- Asia > Middle East
  - Jordan (0.05)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
Q-Learning with Hidden-Unit Restarting
Q-Learning with Hidden-Unit Restarting

Similar Docs Excel Report more

Title	Similarity	Source
None found