Active Exploration in Dynamic Environments

Dec-31-1992–Neural Information Processing Systems

Many real-valued connectionist approaches to learning control realize exploration by randomness inaction selection. This might be disadvantageous when costs are assigned to "negative experiences" . The basic idea presented in this paper is to make an agent explore unknown regions in a more directed manner. This is achieved by a so-called competence map, which is trained to predict the controller's accuracy, and is used for guiding exploration. Based on this, a bistable system enables smoothly switching attention between two behaviors - exploration and exploitation - depending on expected costsand knowledge gain. The appropriateness of this method is demonstrated by a simple robot navigation task.

artificial intelligence, exploration, neural network, (17 more...)

Neural Information Processing Systems

Dec-31-1992

Conferences PDF

Add feedback

Country:
- Europe > United Kingdom
  - England > Cambridgeshire > Cambridge (0.14)
- North America > United States
  - Pennsylvania > Allegheny County > Pittsburgh (0.14)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks (0.97)
  - Robots (1.00)

Duplicate Docs Excel Report

Title
Active Exploration in Dynamic Environments
Active Exploration in Dynamic Environments

Similar Docs Excel Report more

Title	Similarity	Source
None found