GUMBridge: a Corpus for Varieties of Bridging Anaphora
–arXiv.org Artificial Intelligence
Bridging is an anaphoric phenomenon where the referent of an entity in a discourse is dependent on a previous, non-identical entity for interpretation, such as in "There is 'a house'. 'The door' is red," where the door is specifically understood to be the door of the aforementioned house. While there are several existing resources in English for bridging anaphora, most are small, provide limited coverage of the phenomenon, and/or provide limited genre coverage. In this paper, we introduce GUMBridge, a new resource for bridging, which includes 16 diverse genres of English, providing both broad coverage for the phenomenon and granular annotations for the subtype categorization of bridging varieties. We also present an evaluation of annotation quality and report on baseline performance using open and closed source contemporary LLMs on three tasks underlying our data, showing that bridging resolution and subtype classification remain difficult NLP tasks in the age of LLMs.
arXiv.org Artificial Intelligence
Dec-9-2025
- Country:
- Asia > Japan
- Kyūshū & Okinawa > Kyūshū > Miyazaki Prefecture > Miyazaki (0.04)
- Europe
- North America
- Canada > Ontario
- Toronto (0.04)
- Dominican Republic (0.04)
- United States
- California > San Diego County
- San Diego (0.04)
- District of Columbia > Washington (0.04)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- California > San Diego County
- Canada > Ontario
- Asia > Japan
- Genre:
- Research Report (1.00)
- Technology: