Reinforcement Learning Applied to Linear Quadratic Regulation

Dec-31-1993–Neural Information Processing Systems

Recent research on reinforcement learning has focused on algorithms based on the principles of Dynamic Programming (DP). One of the most promising areas of application for these algorithms is the control of dynamical systems, and some impressive results have been achieved. However, there are significant gaps between practice and theory. In particular, there are no con vergence proofs for problems with continuous state and action spaces, or for systems involving nonlinear function approximators (such as multilayer perceptrons). This paper presents research applying DPbased reinforcement learning theory to Linear Quadratic Regulation (LQR), an important class of control problems involving continuous state and action spaces and requiring a simple type of nonlinear function approximator. We describe an algorithm based on Q-Iearning that is proven to converge to the optimal controller for a large class of LQR problems. We also describe a slightly different algorithm that is only locally convergent to the optimal Q-function, demonstrating one of the possible pitfalls of using a nonlinear function approximator with DPbased learning.

algorithm, artificial intelligence, neural network, (13 more...)

Neural Information Processing Systems

Dec-31-1993

Conferences PDF

Add feedback

Country:
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.14)
- North America > United States
  - Massachusetts > Hampshire County > Amherst (0.14)

Genre:
- Research Report > New Finding (0.34)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks > Perceptrons (0.55)
  - Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
Reinforcement Learning Applied to Linear Quadratic Regulation
Reinforcement Learning Applied to Linear Quadratic Regulation

Similar Docs Excel Report more

Title	Similarity	Source
None found