A Pilot Study on Dialogue-Level Dependency Parsing for Chinese
Jiang, Gongyao, Liu, Shuang, Zhang, Meishan, Zhang, Min
–arXiv.org Artificial Intelligence
Dialogue-level dependency parsing has received insufficient attention, especially for Chinese. To this end, we draw on ideas from syntactic dependency and rhetorical structure theory (RST), developing a high-quality human-annotated corpus, which contains 850 dialogues and 199,803 dependencies. Considering that such tasks suffer from high annotation costs, we investigate zero-shot and few-shot scenarios. Based on an existing syntactic treebank, we adopt a signal-based method to transform seen syntactic dependencies into unseen ones between elementary discourse units (EDUs), where the signals are detected by masked language modeling. Besides, we apply single-view and multi-view data selection to access reliable pseudo-labeled instances. Experimental results show the effectiveness of these baselines. Moreover, we discuss several crucial points about our dataset and approach.
arXiv.org Artificial Intelligence
May-31-2023
- Country:
- Africa > Middle East
- Morocco (0.04)
- Asia > China (0.04)
- Europe
- Bulgaria > Sofia City Province
- Sofia (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Italy > Tuscany
- Florence (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- Slovenia (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Bulgaria > Sofia City Province
- North America
- Dominican Republic (0.04)
- United States
- California > Los Angeles County
- Los Angeles (0.04)
- Maryland > Baltimore (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- New Jersey (0.04)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- California > Los Angeles County
- Oceania > Australia
- Africa > Middle East
- Genre:
- Research Report > New Finding (0.34)
- Technology: