Bridging Discourse Treebanks with a Unified Rhetorical Structure Parser
–arXiv.org Artificial Intelligence
We introduce UniRST, the first unified RST-style discourse parser capable of handling 18 treebanks in 11 languages without modifying their relation inventories. To overcome inventory incompatibilities, we propose and evaluate two training strategies: Multi-Head, which assigns separate relation classification layer per inventory, and Masked-Union, which enables shared parameter training through selective label masking. We first benchmark monotreebank parsing with a simple yet effective augmentation technique for low-resource settings. We then train a unified model and show that (1) the parameter efficient Masked-Union approach is also the strongest, and (2) UniRST outperforms 16 of 18 mono-treebank baselines, demonstrating the advantages of a single-model, multilingual end-to-end discourse parsing across diverse resources.
arXiv.org Artificial Intelligence
Oct-9-2025
- Country:
- Asia
- China > Hong Kong (0.04)
- Japan > Honshū
- Kansai > Kyoto Prefecture > Kyoto (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Europe
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Germany > Brandenburg
- Potsdam (0.04)
- Iceland > Capital Region
- Reykjavik (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Croatia > Dubrovnik-Neretva County
- North America
- Canada > Ontario
- Toronto (0.04)
- Dominican Republic (0.04)
- United States
- California (0.14)
- Georgia > Fulton County
- Atlanta (0.04)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Oregon > Multnomah County
- Portland (0.04)
- Canada > Ontario
- Oceania > Australia
- South America > Brazil
- Ceará > Fortaleza (0.04)
- Mato Grosso > Cuiabá (0.04)
- Asia
- Genre:
- Research Report > New Finding (0.46)
- Technology: