Learning discrete state abstractions with deep variational inference

Biza, Ondrej, Platt, Robert, van de Meent, Jan-Willem, Wong, Lawson L. S.

Mar-9-2020–arXiv.org Machine Learning

Abstraction is crucial for effective sequential decision making in domains with large state spaces. In this work, we propose a variational information bottleneck method for learning approximate bisimulations, a type of state abstraction. We use a deep neural net encoder to map states onto continuous embeddings. The continuous latent space is then compressed into a discrete representation using an action-conditioned hidden Markov model, which is trained end-to-end with the neural network. Our method is suited for environments with high-dimensional states and learns from a stream of experience collected by an agent acting in a Markov decision process. Through a learned discrete abstract model, we can efficiently plan for unseen goals in a multi-goal Reinforcement Learning setting. We test our method in simplified robotic manipulation domains with image states. We also compare it against previous model-based approaches to finding bisimulations in discrete grid-world-like environments.

abstraction, bisimulation, state abstraction, (15 more...)

arXiv.org Machine Learning

Mar-9-2020

arXiv.org PDF

Add feedback

Country:
- Asia > China (0.04)
- North America
  - Puerto Rico (0.04)
  - United States
    - New York > New York County
      - New York City (0.14)
    - Massachusetts > Suffolk County
      - Boston (0.04)
    - California > San Diego County
      - San Diego (0.04)
  - Canada > Alberta
    - Census Division No. 15 > Improvement District No. 9 > Banff (0.04)
- Europe > France
  - Hauts-de-France > Nord > Lille (0.04)

Genre:
- Research Report (0.50)

Industry:
- Leisure & Entertainment (0.46)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Neural Networks (1.00)
  - Learning Graphical Models > Undirected Networks
    - Markov Models (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found