Dual-Teacher De-biasing Distillation Framework for Multi-domain Fake News Detection
Li, Jiayang, Feng, Xuan, Gu, Tianlong, Chang, Liang
–arXiv.org Artificial Intelligence
Multi-domain fake news detection aims to identify whether various news from different domains is real or fake and has become urgent and important. However, existing methods are dedicated to improving the overall performance of fake news detection, ignoring the fact that unbalanced data leads to disparate treatment for different domains, i.e., the domain bias problem. To solve this problem, we propose the Dual-Teacher De-biasing Distillation framework (DTDBD) to mitigate bias across different domains. Following the knowledge distillation methods, DTDBD adopts a teacher-student structure, where pre-trained large teachers instruct a student model. In particular, the DTDBD consists of an unbiased teacher and a clean teacher that jointly guide the student model in mitigating domain bias and maintaining performance. For the unbiased teacher, we introduce an adversarial de-biasing distillation loss to instruct the student model in learning unbiased domain knowledge. For the clean teacher, we design domain knowledge distillation loss, which effectively incentivizes the student model to focus on representing domain features while maintaining performance. Moreover, we present a momentum-based dynamic adjustment algorithm to trade off the effects of two teachers. Extensive experiments on Chinese and English datasets show that the proposed method substantially outperforms the state-of-the-art baseline methods in terms of bias metrics while guaranteeing competitive performance.
arXiv.org Artificial Intelligence
Dec-1-2023
- Genre:
- Research Report (1.00)
- Industry:
- Education (1.00)
- Health & Medicine > Therapeutic Area
- Immunology (0.46)
- Infections and Infectious Diseases (0.46)
- Media > News (1.00)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Neural Networks > Deep Learning (0.46)
- Performance Analysis > Accuracy (0.47)
- Natural Language (1.00)
- Machine Learning
- Communications (1.00)
- Data Science > Data Mining (1.00)
- Information Management (0.93)
- Knowledge Management (0.93)
- Artificial Intelligence
- Information Technology