A Systematic Study of Compositional Syntactic Transformer Language Models

Zhao, Yida, Xve, Hao, Hu, Xiang, Tu, Kewei

Jul-1-2025–arXiv.org Artificial Intelligence

Syntactic language models (SLMs) enhance Transformers by incorporating syntactic biases through the modeling of linearized syntactic parse trees alongside surface sentences. This paper focuses on compositional SLMs that are based on constituency parse trees and contain explicit bottom-up composition of constituent representations. We identify key aspects of design choices in existing compositional SLMs and propose a unified framework encompassing both existing models and novel variants. We conduct a comprehensive empirical evaluation of all the variants in our framework across language modeling, syntactic generalization, summarization, dialogue, and inference efficiency. Based on the experimental results, we make multiple recommendations on the design of compositional SLMs. Our code is released at https://github.com/zhaoyd1/compositional_SLMs.

computational linguistic, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Jul-1-2025

arXiv.org PDF

Add feedback

Country:
- Asia
  - China
    - Hong Kong (0.04)
    - Shanghai > Shanghai (0.04)
  - Middle East > UAE
    - Abu Dhabi Emirate > Abu Dhabi (0.04)
  - Singapore (0.04)
  - Taiwan > Taiwan Province
    - Taipei (0.04)
  - Thailand > Bangkok
    - Bangkok (0.04)
- Europe
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
  - Ireland > Leinster
    - County Dublin > Dublin (0.04)
  - Spain
    - Catalonia > Barcelona Province
      - Barcelona (0.04)
    - Galicia > Madrid (0.04)
- North America > United States
  - California > San Diego County
    - San Diego (0.04)
  - Minnesota > Hennepin County
    - Minneapolis (0.14)
  - Texas > Travis County
    - Austin (0.04)

Genre:
- Research Report > New Finding (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)
  - Natural Language > Grammars & Parsing (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found