Deep Belief Nets as Function Approximators for Reinforcement Learning

Abtahi, Farnaz (University of Arizona) | Fasel, Ian (University of Arizona)

Aug-8-2011–AAAI Conferences

We describe a continuous state/action reinforcement learning method which uses deep belief networks (DBNs) in conjunction with a value function-based reinforcement learning algorithm to learn effective control policies. Our approach is to first learn a model of the state-action space from data in an unsupervised pre-training phase, and then use neural-fitted Q-iteration (NFQ) to learn an accurate value function approximator (analogous to a "fine-tuning" phase when training DBNs for classification). Our experiments suggest that this approach has the potential to significantly increase the efficiency of the learning process in NFQ, provided care is taken to ensure the initial data covers interesting areas of the state-action space, and may be particularly useful in transfer learning settings.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

AAAI Conferences

Aug-8-2011

Conferences PDF

Add feedback

Country:
- North America > United States > Arizona > Pima County > Tucson (0.14)

Genre:
- Research Report > Experimental Study (0.34)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Neural Networks > Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found