Learning Spatio-Temporal Planning from a Dynamic Programming Teacher: Feed-Forward Neurocontrol for Moving Obstacle Avoidance
Fahner, Gerald, Eckmiller, Rolf
–Neural Information Processing Systems
The action network is embedded in a sensorymotoric systemarchitecture that contains a separate world model. It is continuously fed with short-term predicted spatiotemporal obstacle trajectories, and receives robot state feedback. The action netallows for external switching between alternative planning tasks.It generates goal-directed motor actions - subject to the robot's kinematic and dynamic constraints - such that collisions withmoving obstacles are avoided. Using supervised learning, we distribute examples of the optimal planner mapping over a structure-level adapted parsimonious higher order network. The training database is generated by a Dynamic Programming algorithm. Extensivesimulations reveal, that the local planner mapping is highly nonlinear, but can be effectively and sparsely represented bythe chosen powerful net model. Excellent generalization occurs for unseen obstacle configurations. We also discuss the limitations offeed-forward neurocontrol for growing planning horizons.
Neural Information Processing Systems
Dec-31-1993
- Country:
- Europe
- Germany (0.05)
- United Kingdom > England
- East Sussex > Brighton (0.04)
- North America > United States
- Colorado > Denver County
- Denver (0.04)
- New York (0.05)
- Colorado > Denver County
- Europe
- Technology: