Planning with an Adaptive World Model
Thrun, Sebastian, Möller, Knut, Linden, Alexander
–Neural Information Processing Systems
We present a new connectionist planning method [TML90]. By interaction with an unknown environment, a world model is progressively constructed usinggradient descent. For deriving optimal actions with respect to future reinforcement, planning is applied in two steps: an experience network proposesa plan which is subsequently optimized by gradient descent with a chain of world models, so that an optimal reinforcement may be obtained when it is actually run. The appropriateness of this method is demonstrated by a robotics application and a pole balancing task.
Neural Information Processing Systems
Dec-31-1991
- Country:
- North America > United States > Massachusetts > Hampshire County > Amherst (0.14)
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science > Problem Solving (0.85)
- Machine Learning (1.00)
- Representation & Reasoning (1.00)
- Robots (1.00)
- Information Technology > Artificial Intelligence