Pushdown Layers: Encoding Recursive Structure in Transformer Language Models

Open in new window