Exploring Multiple Strategies to Improve Multilingual Coreference Resolution in CorefUD
Pražák, Ondřej, Konopík, Miloslav
–arXiv.org Artificial Intelligence
Coreference resolution is the task of identifying language expressions that refer to the same real-world entity (antecedent) within a text. These coreferential expressions can sometimes appear within a single sentence, but often, they are spread across multiple sentences. In some challenging cases, it is necessary to consider the entire document to determine whether two expressions refer to the same entity. The task can be divided into two main subtasks: identifying entity mentions and grouping these mentions based on the real-world entities they refer to. Coreference resolution is closely related to anaphora resolution, as discussed in [2] Historically, coreference resolution was a standard preprocessing step in various natural language processing (NLP) tasks, such as machine translation, summarization, and information extraction. Although recent large language models have achieved state-of-the-art results in coreference resolution, they are expensive to train and deploy, and traditional (discriminative) approaches remain competitive. Expressing this task in natural language is challenging, and to the best of our knowledge, there have been no successful attempts to utilize large chatbots (like ChatGPT-4) to achieve superior results. Coreference resolution becomes particularly challenging in low-resource languages. One strategy to address this challenge is to train a multilingual model on datasets from multiple languages, thereby transferring knowledge from resource-rich languages to those with fewer resources.
arXiv.org Artificial Intelligence
Aug-29-2024
- Country:
- Asia > Singapore (0.04)
- Europe
- Czechia (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Estonia > Tartu County
- Tartu (0.04)
- Faroe Islands > Streymoy
- Tórshavn (0.04)
- Germany
- Berlin (0.04)
- Brandenburg > Potsdam (0.04)
- Hungary > Csongrád-Csanád County
- Szeged (0.04)
- Spain > Valencian Community
- Valencia Province > Valencia (0.04)
- Ukraine (0.04)
- North America > United States
- Maryland > Howard County
- Columbia (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.04)
- Maryland > Howard County
- Oceania > Australia
- Genre:
- Research Report > Experimental Study (0.46)
- Technology: