Efficient Nonlinear Control with Actor-Tutor Architecture
–Neural Information Processing Systems
A new reinforcement learning architecture for nonlinear control is proposed. A direct feedback controller, or the actor, is trained by a value-gradient based controller, or the tutor. This architecture enables both efficient use of the value function and simple computation for real-time implementation. Good performance was verified in multidimensional nonlinear control tasks using Gaussian softmax networks.
Neural Information Processing Systems
Dec-31-1997
- Country:
- North America > United States
- New York (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.05)
- Asia > Japan
- Honshū > Kansai > Kyoto Prefecture > Kyoto (0.05)
- North America > United States
- Industry:
- Health & Medicine (0.72)
- Technology: