Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning
Zhao, Mingde, Alver, Safa, van Seijen, Harm, Laroche, Romain, Precup, Doina, Bengio, Yoshua
–arXiv.org Artificial Intelligence
Inspired by human conscious planning, we propose Skipper, a model-based reinforcement learning agent utilizing spatio-temporal abstractions to generalize learned skills in novel situations. It automatically decomposes the given task into smaller, more manageable subtasks, and hence enables sparse decision-making and focused computation on the relevant parts of the environment. This relies on the extraction of an abstracted proxy problem represented as a directed graph, in which vertices and edges are learned end-to-end from hindsight. Our theoretical analyses provide performance guarantees under appropriate assumptions and establish where our approach is expected to be helpful. Generalization-focused experiments validate Skipper's significant advantage in zero-shot generalization, compared to existing state-of-the-art hierarchical planning methods.
arXiv.org Artificial Intelligence
Feb-4-2024
- Country:
- Oceania > Australia
- New South Wales > Sydney (0.04)
- North America
- United States
- New York > New York County
- New York City (0.04)
- California > San Francisco County
- San Francisco (0.14)
- New York > New York County
- Canada > Quebec
- Montreal (0.14)
- United States
- Oceania > Australia
- Genre:
- Research Report
- Experimental Study (0.93)
- New Finding (0.93)
- Research Report
- Industry:
- Education (1.00)
- Technology: