Better Benchmarking LLMs for Zero-Shot Dependency Parsing

Ezquerro, Ana, Gómez-Rodríguez, Carlos, Vilares, David

Feb-28-2025–arXiv.org Artificial Intelligence

While LLMs excel in zero-shot tasks, their performance in linguistic challenges like syntactic parsing has been less scrutinized. This paper studies state-of-the-art open-weight LLMs on the task by comparing them to baselines that do not have access to the input sentence, including baselines that have not been used in this context such as random projective trees or optimal linear arrangements. The results show that most of the tested LLMs cannot outperform the best uninformed baselines, with only the newest and largest versions of LLaMA doing so for most languages, and still achieving rather low performance. Thus, accurate zero-shot syntactic parsing is not forthcoming with open LLMs.

baseline, computational linguistic, linguistic, (14 more...)

arXiv.org Artificial Intelligence

Feb-28-2025

arXiv.org PDF

Add feedback

Country:
- North America
  - Dominican Republic (0.04)
  - Mexico > Mexico City
    - Mexico City (0.04)
  - Canada > Ontario
    - Toronto (0.04)
- Europe
  - Ukraine > Kyiv Oblast
    - Kyiv (0.04)
  - Spain
    - Galicia > A Coruña Province
      - A Coruña (0.04)
    - Catalonia > Barcelona Province
      - Barcelona (0.04)
  - France > Île-de-France
    - Paris > Paris (0.04)
  - Bulgaria > Sofia City Province
    - Sofia (0.04)
- Asia
  - Singapore (0.04)
  - China > Hong Kong (0.04)
  - British Indian Ocean Territory > Diego Garcia (0.04)
  - Thailand > Bangkok
    - Bangkok (0.04)
  - Middle East
    - Jordan (0.04)
    - Saudi Arabia > Asir Province
      - Abha (0.04)

Genre:
- Research Report > New Finding (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found