shortcut token
DBR: Divergence-Based Regularization for Debiasing Natural Language Understanding Models
Li, Zihao, Tang, Ruixiang, Cheng, Lu, Wang, Shuaiqiang, Yin, Dawei, Du, Mengnan
Pre-trained language models (PLMs) have achieved impressive results on various natural language processing tasks. However, recent research has revealed that these models often rely on superficial features and shortcuts instead of developing a genuine understanding of language, especially for natural language understanding (NLU) tasks. Consequently, the models struggle to generalize to out-of-domain data. In this work, we propose Divergence Based Regularization (DBR) to mitigate this shortcut learning behavior. Our method measures the divergence between the output distributions for original examples and examples where shortcut tokens have been masked. This process prevents the model's predictions from being overly influenced by shortcut features or biases. We evaluate our model on three NLU tasks and find that it improves out-of-domain performance with little loss of in-domain accuracy. Our results demonstrate that reducing the reliance on shortcuts and superficial features can enhance the generalization ability of large pre-trained language models.
- Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
- North America > United States > New Jersey (0.04)
- North America > United States > Illinois > Cook County > Chicago (0.04)
Towards Faithful Explanations: Boosting Rationalization with Shortcuts Discovery
Yue, Linan, Liu, Qi, Du, Yichao, Wang, Li, Gao, Weibo, An, Yanqing
The remarkable success in neural networks provokes the selective rationalization. It explains the prediction results by identifying a small subset of the inputs sufficient to support them. Since existing methods still suffer from adopting the shortcuts in data to compose rationales and limited large-scale annotated rationales by human, in this paper, we propose a Shortcuts-fused Selective Rationalization (SSR) method, which boosts the rationalization by discovering and exploiting potential shortcuts. Specifically, SSR first designs a shortcuts discovery approach to detect several potential shortcuts. Then, by introducing the identified shortcuts, we propose two strategies to mitigate the problem of utilizing shortcuts to compose rationales. Finally, we develop two data augmentations methods to close the gap in the number of annotated rationales. Extensive experimental results on real-world datasets clearly validate the effectiveness of our proposed method.
- South America > Colombia > Meta Department > Villavicencio (0.04)
- North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
- Europe > France (0.04)
- Asia > China > Anhui Province (0.04)
- Media > Film (1.00)
- Leisure & Entertainment (1.00)