Unsupervised Learning of Temporal Abstractions with Slot-based Transformers
Gopalakrishnan, Anand, Irie, Kazuki, Schmidhuber, Jürgen, van Steenkiste, Sjoerd
–arXiv.org Artificial Intelligence
The discovery of reusable sub-routines simplifies decision-making and planning in complex reinforcement learning problems. Previous approaches propose to learn such temporal abstractions in a purely unsupervised fashion through observing state-action trajectories gathered from executing a policy. However, a current limitation is that they process each trajectory in an entirely sequential manner, which prevents them from revising earlier decisions about sub-routine boundary points in light of new incoming information. In this work we propose SloTTAr, a fully parallel approach that integrates sequence processing Transformers with a Slot Attention module and adaptive computation for learning about the number of such sub-routines in an unsupervised fashion. We demonstrate how SloTTAr is capable of outperforming strong baselines in terms of boundary point discovery, even for sequences containing variable amounts of sub-routines, while being up to 7x faster to train on existing benchmarks.
arXiv.org Artificial Intelligence
Nov-22-2022
- Country:
- Asia > Middle East
- Qatar > Ad-Dawhah
- Doha (0.04)
- Saudi Arabia > Mecca Province
- Thuwal (0.04)
- Qatar > Ad-Dawhah
- Europe
- France (0.04)
- Germany > North Rhine-Westphalia
- Cologne Region > Bonn (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Switzerland (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.04)
- North America
- Canada
- United States
- California
- Los Angeles County > Long Beach (0.04)
- San Francisco County > San Francisco (0.14)
- Colorado > Denver County
- Denver (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- New York
- Bronx County > New York City (0.04)
- Kings County > New York City (0.04)
- New York County > New York City (0.04)
- Queens County > New York City (0.04)
- Richmond County > New York City (0.04)
- Pennsylvania > Allegheny County
- Pittsburgh (0.04)
- Tennessee > Davidson County
- Nashville (0.04)
- California
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Asia > Middle East
- Genre:
- Research Report (1.00)
- Industry:
- Education > Focused Education (0.34)
- Technology: