Predicate-Argument Structure Divergences in Chinese and English Parallel Sentences and their Impact on Language Transfer
–arXiv.org Artificial Intelligence
Cross-lingual Natural Language Processing (NLP) has gained significant traction in recent years, offering practical solutions in low-resource settings by transferring linguistic knowledge from resource-rich to low-resource languages. This field leverages techniques like annotation projection and model transfer for language adaptation, supported by multilingual pre-trained language models. However, linguistic divergences hinder language transfer, especially among typologically distant languages. In this paper, we present an analysis of predicate-argument structures in parallel Chinese and English sentences. We explore the alignment and misalignment of predicate annotations, inspecting similarities and differences and proposing a categorization of structural divergences. The analysis and the categorization are supported by a qualitative and quantitative analysis of the results of an annotation projection experiment, in which, in turn, one of the two languages has been used as source language to project annotations into the corresponding parallel sentences. The results of this analysis show clearly that language transfer is asymmetric. An aspect that requires attention when it comes to selecting the source language in transfer learning applications and that needs to be investigated before any scientific claim about cross-lingual NLP is proposed.
arXiv.org Artificial Intelligence
Nov-14-2025
- Country:
- Africa
- Burkina Faso (0.04)
- Mali (0.04)
- Middle East > Egypt (0.04)
- Asia
- China > Beijing
- Beijing (0.04)
- Middle East
- Lebanon (0.04)
- Palestine (0.14)
- Qatar > Ad-Dawhah
- Doha (0.04)
- Republic of Türkiye > Istanbul Province
- Istanbul (0.04)
- UAE > Abu Dhabi Emirate
- Abu Dhabi (0.04)
- Uzbekistan (0.04)
- China > Beijing
- Europe
- Czechia > Prague (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Italy
- Slovenia (0.04)
- Norway (0.04)
- Bulgaria > Sofia City Province
- Sofia (0.04)
- Finland (0.04)
- Denmark (0.04)
- Netherlands (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Germany
- Iceland > Capital Region
- Reykjavik (0.04)
- Austria (0.04)
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.14)
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Costa Rica (0.04)
- Cuba (0.04)
- Dominican Republic (0.04)
- El Salvador (0.04)
- United States
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- New York (0.04)
- Louisiana > Orleans Parish
- Canada
- South America
- Africa
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Government (0.92)
- Health & Medicine (0.87)
- Technology: