A Rationale-centric Counterfactual Data Augmentation Method for Cross-Document Event Coreference Resolution
Ding, Bowen, Min, Qingkai, Ma, Shengkun, Li, Yingjie, Yang, Linyi, Zhang, Yue
–arXiv.org Artificial Intelligence
Based on Pre-trained Language Models (PLMs), event coreference resolution (ECR) systems have demonstrated outstanding performance in clustering coreferential events across documents. However, the state-of-the-art system exhibits an excessive reliance on the'triggers lexical matching' spurious pattern in the input mention pair text. We formalize the decision-making process of the baseline ECR system using a Structural Causal Model (SCM), aiming to identify spurious and causal associations (i.e., rationales) within the ECR task. Leveraging the debiasing capability of counterfactual data augmentation, we develop a rationale-centric counterfactual data augmentation method with LLM-in-the-loop. This method is specialized for pairwise input in the Figure 1: The distribution of'triggers lexical matching' ECR system, where we conduct direct interventions in mention pairs from ECB+ training set, along with a on triggers and context to mitigate the false negative example from Held et al.'s system which spurious association while emphasizing the causation.
arXiv.org Artificial Intelligence
May-8-2024
- Country:
- Asia > Indonesia
- New Guinea > Western New Guinea (0.14)
- Europe (1.00)
- North America > United States
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Missouri > Jackson County
- Kansas City (0.14)
- Minnesota > Hennepin County
- Asia > Indonesia
- Genre:
- Personal > Obituary (1.00)
- Research Report (1.00)
- Industry:
- Health & Medicine (0.68)
- Information Technology > Security & Privacy (1.00)
- Leisure & Entertainment > Sports
- Media > Film (0.67)
- Technology: