AITopics | ordered memory

Ordered Memory

Neural Information Processing SystemsDec-26-2025, 01:06:59 GMT

Stack-augmented recurrent neural networks (RNNs) have been of interest to the deep learning community for some time. However, the difficulty of training memory models remains a problem obstructing the widespread use of such models. In this paper, we propose the Ordered Memory architecture. Inspired by Ordered Neurons (Shen et al., 2018), we introduce a new attention-based mechanism and use its cumulative probability to control the writing and erasing operation of the memory. We also introduce a new Gated Recursive Cell to compose lower-level representations into higher-level representation. We demonstrate that our model achieves strong performance on the logical inference task (Bowman et al., 2015) and the ListOps (Nangia and Bowman, 2018) task. We can also interpret the model to retrieve the induced tree structure, and find that these induced structures align with the ground truth. Finally, we evaluate our model on the Stanford Sentiment Treebank tasks (Socher et al., 2013), and find that it performs comparatively with the state-of-the-art methods in the literature.

electronic proceedings, name change, ordered memory, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.60)

Add feedback

Reviews: Ordered Memory

Neural Information Processing SystemsFeb-11-2025, 23:33:42 GMT

This paper presents a novel model design/algorithm for building compositional representations of sequences when (as in natural language or code) it is presumed that the sequences have salient latent structure that can be described as a binary tree. The method performs essentially at ceiling on two existing artificial datasets that were designed for this task, both of which have not been previously solved under comparable conditions. The method also performs reasonably well on a sentiment analysis task. Pros: The method is novel and solves a couple of prominent instances of an important open problem in deep learning for NLP and similar domains with latent structure: How to we build models that can efficiently learn and to build compositional representations using latent structure? This is interesting and likely to garner a reasonably large audience as a somewhat abstract/artificial result.

compositional representation, latent structure, ordered memory, (4 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.80)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.38)

Add feedback

Reviews: Ordered Memory

Neural Information Processing SystemsFeb-11-2025, 23:33:31 GMT

The reviewers reached, after discussion, the consensus that this paper presenting a novel way of modelling strucutured memory is worth including in the conference. The modelling aspect of the paper was of interest to the reviewers, who were furthermore reasonably confident that the method has empirical merit thanks to the experiments both synthetic and "real world". Perhaps the main weakness of this paper is that while the synthetic experiments prove the concepts and the sentiment analysis experiments show robustness to noisy data, further non-synthetic experiments might have further showcased applications of this method to tasks which the community cares about. For now, I find it of a sufficient standard for publication, and anticipate that further work will demonstrate whether this method stands well against other tasks... or not.

experiment, ordered memory, reviewer

Neural Information Processing Systems

Technology:

Information Technology > Communications > Social Media (0.32)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.32)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.32)

Add feedback

Ordered Memory

Neural Information Processing SystemsJan-27-2025, 10:40:01 GMT

Stack-augmented recurrent neural networks (RNNs) have been of interest to the deep learning community for some time. However, the difficulty of training memory models remains a problem obstructing the widespread use of such models. In this paper, we propose the Ordered Memory architecture. Inspired by Ordered Neurons (Shen et al., 2018), we introduce a new attention-based mechanism and use its cumulative probability to control the writing and erasing operation of the memory. We also introduce a new Gated Recursive Cell to compose lower-level representations into higher-level representation.

ordered memory, representation

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.64)

Add feedback

Beam Tree Recursive Cells

Chowdhury, Jishnu Ray, Caragea, Cornelia

arXiv.org Artificial IntelligenceJun-20-2023

We propose Beam Tree Recursive Cell (BT-Cell) - a backpropagation-friendly framework to extend Recursive Neural Networks (RvNNs) with beam search for latent structure induction. We further extend this framework by proposing a relaxation of the hard top-k operators in beam search for better propagation of gradient signals. We evaluate our proposed models in different out-of-distribution splits in both synthetic and realistic data. Our experiments show that BTCell achieves near-perfect performance on several challenging structure-sensitive synthetic tasks like ListOps and logical inference while maintaining comparable performance in realistic data against other RvNN-based models. Additionally, we identify a previously unknown failure case for neural models in generalization to unseen number of arguments in ListOps. The code is available at: https://github.com/JRC1995/BeamTreeRecursiveCells.

computational linguistic, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2305.19999

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(19 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)

Add feedback

Ordered Memory Baselines

Borisov, Daniel, D'Iorio, Matthew, Hyacinthe, Jeffrey

arXiv.org Artificial IntelligenceFeb-8-2023

Natural language semantics can be modeled using the phrase-structured model, which can be represented using a tree-type architecture. As a result, recent advances in natural language processing have been made utilising recursive neural networks using memory models that allow them to infer tree-type representations of the input sentence sequence. These new tree models have allowed for improvements in sentiment analysis and semantic recognition. Here we review the Ordered Memory model proposed by Shen et al. (2019) at the NeurIPS 2019 conference, and try to either create baselines that can perform better or create simpler models that can perform equally as well. We found that the Ordered Memory model performs on par with the state-of-the-art models used in tree-type modelling, and performs better than simplified baselines that require fewer parameters.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2302.06451

Country: North America > Canada > Quebec > Montreal (0.47)

Genre:

Research Report (0.70)
Overview (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Ordered Memory

Shen, Yikang, Tan, Shawn, Hosseini, Arian, Lin, Zhouhan, Sordoni, Alessandro, Courville, Aaron C.

Neural Information Processing SystemsMar-18-2020, 22:32:12 GMT

Stack-augmented recurrent neural networks (RNNs) have been of interest to the deep learning community for some time. However, the difficulty of training memory models remains a problem obstructing the widespread use of such models. In this paper, we propose the Ordered Memory architecture. Inspired by Ordered Neurons (Shen et al., 2018), we introduce a new attention-based mechanism and use its cumulative probability to control the writing and erasing operation of the memory. We also introduce a new Gated Recursive Cell to compose lower-level representations into higher-level representation.

ordered memory, representation

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.64)

Add feedback