Learning Factored Markov Decision Processes with Unawareness

Feb-27-2019–arXiv.org Artificial Intelligence

Methods for learning and planning in sequential decision problems often assume the learner is aware of all possible states and actions in advance. This assumption is sometimes untenable. In this paper, we give a method to learn factored markov decision problems from both domain exploration and expert assistance, which guarantees convergence to near-optimal behaviour, even when the agent begins unaware of factors critical to success. Our experiments show our agent learns optimal behaviour on small and large problems, and that conserving information on discovering new possibilities results in faster convergence.

agent, artificial intelligence, machine learning, (13 more...)

arXiv.org Artificial Intelligence

Feb-27-2019

arXiv.org PDF

Add feedback

Country:
- Europe > United Kingdom
  - Scotland > City of Edinburgh > Edinburgh (0.04)
- Asia > Japan
  - Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Genre:
- Research Report (0.50)

Industry:
- Education (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Agents (1.00)
  - Machine Learning > Learning Graphical Models
    - Undirected Networks > Markov Models (0.83)
    - Directed Networks > Bayesian Learning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found