Understanding Syntactic Generalization in Structure-inducing Language Models
Arps, David, Sajjad, Hassan, Kallmeyer, Laura
–arXiv.org Artificial Intelligence
Structure-inducing Language Models (SiLM) are trained on a self-supervised language modeling task, and induce a hierarchical sentence representation as a byproduct when processing an input. SiLMs couple strong syntactic generalization behavior with competitive performance on various NLP tasks, but many of their basic properties are yet underexplored. In this work, we train three different SiLM architectures from scratch: Structformer (Shen et al., 2021), UDGN (Shen et al., 2022), and GPST (Hu et al., 2024b). We train these architectures on both natural language (English, German, and Chinese) corpora and synthetic bracketing expressions. The models are then evaluated with respect to (i) properties of the induced syntactic representations (ii) performance on grammaticality judgment tasks, and (iii) training dynamics. We find that none of the three architectures dominates across all evaluation metrics. However, there are significant differences, in particular with respect to the induced syntactic representations. The Generative Pretrained Structured Transformer (GPST; Hu et al. 2024) performs most consistently across evaluation settings, and outperforms the other models on long-distance dependencies in bracketing expressions. Furthermore, our study shows that small models trained on large amounts of synthetic data provide a useful testbed for evaluating basic model properties.
arXiv.org Artificial Intelligence
Dec-9-2025
- Country:
- Asia
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- France > Occitanie
- Haute-Garonne > Toulouse (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Slovenia (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Germany
- Bavaria > Upper Bavaria
- Munich (0.04)
- Berlin (0.04)
- North Rhine-Westphalia > Düsseldorf Region
- Düsseldorf (0.04)
- Bavaria > Upper Bavaria
- Italy > Tuscany
- Florence (0.04)
- Austria > Vienna (0.14)
- Belgium > Brussels-Capital Region
- North America
- Canada
- Nova Scotia > Halifax Regional Municipality
- Halifax (0.04)
- Ontario > Toronto (0.04)
- Nova Scotia > Halifax Regional Municipality
- Dominican Republic (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- United States
- Colorado > Denver County
- Denver (0.04)
- Florida > Miami-Dade County
- Miami (0.04)
- Maryland (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.14)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Pennsylvania > Philadelphia County
- Philadelphia (0.04)
- Colorado > Denver County
- Canada
- Oceania > Australia
- Genre:
- Research Report (0.82)
- Technology: