Recursive Algorithmic Reasoning

Jürß, Jonas, Jayalath, Dulhan, Veličković, Petar

Nov-20-2023–arXiv.org Artificial Intelligence

Learning models that execute algorithms can enable us to address a key problem in deep learning: generalizing to out-of-distribution data. However, neural networks are currently unable to execute recursive algorithms because they do not have arbitrarily large memory to store and recall state. To address this, we (1) propose a way to augment graph neural networks (GNNs) with a stack, and (2) develop an approach for capturing intermediate algorithm trajectories that improves algorithmic alignment with recursive algorithms over previous methods. The stack allows the network to learn to store and recall a portion of the state of the network at a particular time, analogous to the action of a call stack in a recursive algorithm. This augmentation permits the network to reason recursively. We empirically demonstrate that our proposals significantly improve generalization to larger input graphs over prior work on depth-first search (DFS).

algorithm, neural network, node, (15 more...)

arXiv.org Artificial Intelligence

Nov-20-2023

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - Maryland > Baltimore (0.04)
    - Massachusetts > Middlesex County
      - Cambridge (0.04)
    - Hawaii > Honolulu County
      - Honolulu (0.04)
    - California > San Diego County
      - San Diego (0.04)
  - Puerto Rico > San Juan
    - San Juan (0.04)
- Europe
  - France (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.14)
    - Oxfordshire > Oxford (0.04)
  - Germany > North Rhine-Westphalia
    - Upper Bavaria > Munich (0.04)
- Asia > China
  - Beijing > Beijing (0.04)
- Africa
  - Rwanda > Kigali
    - Kigali (0.04)
  - Ethiopia > Addis Ababa
    - Addis Ababa (0.04)

Genre:
- Workflow (0.68)
- Research Report (0.50)

Technology:
- Information Technology
  - Software > Programming Languages (1.00)
  - Artificial Intelligence > Machine Learning
    - Neural Networks > Deep Learning (0.66)