Better Benchmarking LLMs for Zero-Shot Dependency Parsing
Ezquerro, Ana, Gómez-Rodríguez, Carlos, Vilares, David
–arXiv.org Artificial Intelligence
While LLMs excel in zero-shot tasks, their performance in linguistic challenges like syntactic parsing has been less scrutinized. This paper studies state-of-the-art open-weight LLMs on the task by comparing them to baselines that do not have access to the input sentence, including baselines that have not been used in this context such as random projective trees or optimal linear arrangements. The results show that most of the tested LLMs cannot outperform the best uninformed baselines, with only the newest and largest versions of LLaMA doing so for most languages, and still achieving rather low performance. Thus, accurate zero-shot syntactic parsing is not forthcoming with open LLMs.
arXiv.org Artificial Intelligence
Feb-28-2025
- Country:
- Asia
- British Indian Ocean Territory > Diego Garcia (0.04)
- China > Hong Kong (0.04)
- Middle East
- Jordan (0.04)
- Saudi Arabia > Asir Province
- Abha (0.04)
- Singapore (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Europe
- Bulgaria > Sofia City Province
- Sofia (0.04)
- France > Île-de-France
- Spain
- Catalonia > Barcelona Province
- Barcelona (0.04)
- Galicia > A Coruña Province
- A Coruña (0.04)
- Catalonia > Barcelona Province
- Ukraine > Kyiv Oblast
- Kyiv (0.04)
- Bulgaria > Sofia City Province
- North America
- Canada > Ontario
- Toronto (0.04)
- Dominican Republic (0.04)
- Mexico > Mexico City
- Mexico City (0.04)
- Canada > Ontario
- Asia
- Genre:
- Research Report > New Finding (1.00)
- Technology: