Systematic Investigation of Strategies Tailored for Low-Resource Settings for Low-Resource Dependency Parsing

Sandhan, Jivnesh, Behera, Laxmidhar, Goyal, Pawan

Jan-29-2023–arXiv.org Artificial Intelligence

In this work, we focus on low-resource dependency parsing for multiple languages. Several strategies are tailored to enhance performance in low-resource scenarios. While these are well-known to the community, it is not trivial to select the best-performing combination of these strategies for a low-resource language that we are interested in, and not much attention has been given to measuring the efficacy of these strategies. We experiment with 5 low-resource strategies for our ensembled approach on 7 Universal Dependency (UD) low-resource languages. Our exhaustive experimentation on these languages supports the effective improvements for languages not covered in pretrained models. We show a successful application of the ensembled system on a truly low-resource language Sanskrit. The code and data are available at: https://github.com/Jivnesh/SanDP

computational linguistic, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

Jan-29-2023

arXiv.org PDF

Add feedback

Country:
- Oceania > Australia
  - Victoria > Melbourne (0.05)
- North America
  - United States
    - Oregon > Multnomah County
      - Portland (0.04)
    - Minnesota > Hennepin County
      - Minneapolis (0.14)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
  - Canada > British Columbia
    - Metro Vancouver Regional District > Vancouver (0.04)
- Europe
  - Italy > Tuscany
    - Florence (0.04)
  - France > Provence-Alpes-Côte d'Azur
    - Bouches-du-Rhône > Marseille (0.04)
  - Denmark > Capital Region
    - Copenhagen (0.04)
  - Belgium > Brussels-Capital Region
    - Brussels (0.05)
- Asia
  - South Korea (0.04)
  - Indonesia > Bali (0.04)
  - Middle East > Qatar
    - Ad-Dawhah > Doha (0.04)
  - India > West Bengal
    - Kharagpur (0.04)
  - China
    - Hong Kong (0.05)
    - Beijing > Beijing (0.04)

Genre:
- Research Report (0.82)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks (0.94)
  - Natural Language > Grammars & Parsing (0.67)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found