Compositional Generalisation with Structured Reordering and Fertility Layers
Lindemann, Matthias, Koller, Alexander, Titov, Ivan
–arXiv.org Artificial Intelligence
Seq2seq models have been shown to struggle with compositional generalisation, i.e. generalising to new and potentially more complex structures than seen during training. Taking inspiration from grammar-based models that excel at compositional generalisation, we present a flexible end-to-end differentiable neural model that composes two structural operations: a fertility step, which we introduce in this work, and a reordering step based on previous work (Wang et al., 2021). To ensure differentiability, we use the expected value of each step. Our model outperforms seq2seq models by a wide margin on challenging compositional splits of realistic semantic parsing tasks that require generalisation to longer examples. It also compares favourably to other models targeting compositional generalisation.
arXiv.org Artificial Intelligence
Feb-15-2023
- Country:
- South America > Chile
- Oceania > Australia
- North America
- Dominican Republic (0.04)
- Canada (0.04)
- United States
- Texas (0.04)
- New York (0.04)
- New Jersey (0.04)
- California > San Diego County
- San Diego (0.04)
- Europe
- France (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Italy > Tuscany
- Florence (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Germany
- Asia
- Middle East > Qatar
- China > Guangxi Province
- Nanning (0.04)
- Genre:
- Research Report (0.64)
- Technology: