Integrated Modeling and Control Based on Reinforcement Learning and Dynamic Programming

Dec-31-1991–Neural Information Processing Systems

This is a summary of results with Dyna, a class of architectures for intelligent systems based on approximating dynamic programming methods. Dyna architectures integrate trial-and-error (reinforcement) learning and execution-time planning into a single process operating alternately on the world and on a learned forward model of the world. We describe and show results for two Dyna architectures, Dyna-AHC and Dyna-Q. Using a navigation task, results are shown for a simple Dyna-AHC system which simultaneously learns by trial and error, learns a world model, and plans optimal routes using the evolving world model. We show that Dyna-Q architectures (based on Watkins's Q-Iearning) are easy to adapt for use in changing environments.

architecture, evaluation function, world model, (14 more...)

Neural Information Processing Systems

Dec-31-1991

Conferences PDF

Add feedback

Country:
- North America > United States
  - Massachusetts > Middlesex County > Waltham (0.04)
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.04)
- Asia > Middle East
  - Iran (0.04)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
Integrated Modeling and Control Based on Reinforcement Learning and Dynamic Programming
Integrated Modeling and Control Based on Reinforcement Learning and Dynamic Programming

Similar Docs Excel Report more

Title	Similarity	Source
None found