Investigating Failures to Generalize for Coreference Resolution Models
Porada, Ian, Olteanu, Alexandra, Suleman, Kaheer, Trischler, Adam, Cheung, Jackie Chi Kit
–arXiv.org Artificial Intelligence
Coreference resolution models are often evaluated on multiple datasets. Datasets vary, however, in how coreference is realized -- i.e., how the theoretical concept of coreference is operationalized in the dataset -- due to factors such as the choice of corpora and annotation guidelines. We investigate the extent to which errors of current coreference resolution models are associated with existing differences in operationalization across datasets (OntoNotes, PreCo, and Winogrande). Specifically, we distinguish between and break down model performance into categories corresponding to several types of coreference, including coreferring generic mentions, compound modifiers, and copula predicates, among others. This break down helps us investigate how state-of-the-art models might vary in their ability to generalize across different coreference types. In our experiments, for example, models trained on OntoNotes perform poorly on generic mentions and copula predicates in PreCo. Our findings help calibrate expectations of current coreference resolution models; and, future work can explicitly account for those types of coreference that are empirically associated with poor generalization when developing models.
arXiv.org Artificial Intelligence
Mar-16-2023
- Country:
- North America
- El Salvador (0.14)
- Dominican Republic (0.04)
- Costa Rica (0.04)
- United States
- Maryland (0.04)
- Washington > King County
- Seattle (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Colorado > Denver County
- Denver (0.04)
- California > San Diego County
- San Diego (0.04)
- Canada
- Quebec > Montreal (0.14)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Europe
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- Russia > Central Federal District
- Moscow Oblast > Moscow (0.04)
- Italy > Lazio
- Rome (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Sweden > Vaestra Goetaland
- Asia
- South Korea (0.04)
- China > Hong Kong (0.04)
- Taiwan (0.04)
- Middle East > Jordan (0.04)
- Singapore (0.04)
- Russia (0.04)
- Africa > Middle East
- Morocco (0.04)
- North America
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Health & Medicine (0.47)
- Government (0.46)
- Technology: