Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning

Zhao, Mingde, Alver, Safa, van Seijen, Harm, Laroche, Romain, Precup, Doina, Bengio, Yoshua

Feb-4-2024–arXiv.org Artificial Intelligence

Inspired by human conscious planning, we propose Skipper, a model-based reinforcement learning agent utilizing spatio-temporal abstractions to generalize learned skills in novel situations. It automatically decomposes the given task into smaller, more manageable subtasks, and hence enables sparse decision-making and focused computation on the relevant parts of the environment. This relies on the extraction of an abstracted proxy problem represented as a directed graph, in which vertices and edges are learned end-to-end from hindsight. Our theoretical analyses provide performance guarantees under appropriate assumptions and establish where our approach is expected to be helpful. Generalization-focused experiments validate Skipper's significant advantage in zero-shot generalization, compared to existing state-of-the-art hierarchical planning methods.

agent, checkpoint, diff 0, (13 more...)

arXiv.org Artificial Intelligence

Feb-4-2024

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - New South Wales > Sydney (0.04)
- North America
  - United States
    - New York > New York County
      - New York City (0.04)
    - California > San Francisco County
      - San Francisco (0.14)
  - Canada > Quebec
    - Montreal (0.14)

Genre:
- Research Report
  - Experimental Study (0.93)
  - New Finding (0.93)

Industry:
- Education (1.00)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning
  - Reinforcement Learning (1.00)
  - Neural Networks > Deep Learning (0.46)