Ordered Memory

Shen, Yikang, Tan, Shawn, Hosseini, Arian, Lin, Zhouhan, Sordoni, Alessandro, Courville, Aaron C.

Neural Information Processing Systems 

Stack-augmented recurrent neural networks (RNNs) have been of interest to the deep learning community for some time. However, the difficulty of training memory models remains a problem obstructing the widespread use of such models. In this paper, we propose the Ordered Memory architecture. Inspired by Ordered Neurons (Shen et al., 2018), we introduce a new attention-based mechanism and use its cumulative probability to control the writing and erasing operation of the memory. We also introduce a new Gated Recursive Cell to compose lower-level representations into higher-level representation.