State Abstraction in MAXQ Hierarchical Reinforcement Learning

Dec-31-2000–Neural Information Processing Systems

Forexample, in the Options framework [1,2], the programmer defines a set of macro actions ("options") and provides a policy for each. Learning algorithms (such as semi-Markov Q learning) can then treat these temporally abstract actions as if they were primitives and learn a policy for selecting among them. Closely related is the HAM framework, in which the programmer constructs a hierarchy of finitestate controllers[3]. Each controller can include non-deterministic states (where the programmer was not sure what action to perform). The HAMQ learning algorithm can then be applied to learn a policy for making choices in the non-deterministic states.

artificial intelligence, machine learning, reinforcement learning, (14 more...)

Neural Information Processing Systems

Dec-31-2000

Conferences PDF

Add feedback

Country:
- North America > United States
  - California > San Francisco County
    - San Francisco (0.14)
  - Massachusetts (0.14)
  - Oregon (0.14)

Industry:
- Transportation (0.31)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Duplicate Docs Excel Report

Title
State Abstraction in MAXQ Hierarchical Reinforcement Learning
State Abstraction in MAXQ Hierarchical Reinforcement Learning

Similar Docs Excel Report more

Title	Similarity	Source
None found