Efficient Nonlinear Control with Actor-Tutor Architecture

Doya, Kenji

Neural Information Processing Systems 

A new reinforcement learning architecture for nonlinear control is proposed. A direct feedback controller, or the actor, is trained by a value-gradient based controller, or the tutor. This architecture enables both efficient use of the value function and simple computation for real-time implementation. Good performance was verified in multidimensional nonlinear control tasks using Gaussian softmax networks.

Similar Docs  Excel Report  more

TitleSimilaritySource
None found