Systematic Investigation of Strategies Tailored for Low-Resource Settings for Low-Resource Dependency Parsing
Sandhan, Jivnesh, Behera, Laxmidhar, Goyal, Pawan
–arXiv.org Artificial Intelligence
In this work, we focus on low-resource dependency parsing for multiple languages. Several strategies are tailored to enhance performance in low-resource scenarios. While these are well-known to the community, it is not trivial to select the best-performing combination of these strategies for a low-resource language that we are interested in, and not much attention has been given to measuring the efficacy of these strategies. We experiment with 5 low-resource strategies for our ensembled approach on 7 Universal Dependency (UD) low-resource languages. Our exhaustive experimentation on these languages supports the effective improvements for languages not covered in pretrained models. We show a successful application of the ensembled system on a truly low-resource language Sanskrit. The code and data are available at: https://github.com/Jivnesh/SanDP
arXiv.org Artificial Intelligence
Jan-29-2023
- Country:
- Oceania > Australia
- North America
- Europe
- Italy > Tuscany
- Florence (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.05)
- Italy > Tuscany
- Asia
- South Korea (0.04)
- Indonesia > Bali (0.04)
- Middle East > Qatar
- India > West Bengal
- Kharagpur (0.04)
- China
- Genre:
- Research Report (0.82)
- Technology: