Unveiling Modality Bias: Automated Sample-Specific Analysis for Multimodal Misinformation Benchmarks
Lin, Hehai, Liu, Hui, Cao, Shilei, Li, Jing, Li, Haoliang, Wang, Wenya
–arXiv.org Artificial Intelligence
Numerous multimodal misinformation benchmarks exhibit bias toward specific modalities, allowing detectors to make predictions based solely on one modality. While previous research has quantified bias at the dataset level or manually identified spurious correlations between modalities and labels, these approaches lack meaningful insights at the sample level and struggle to scale to the vast amount of online information. In this paper, we investigate the design for automated recognition of modality bias at the sample level. Specifically, we propose three bias quantification methods based on theories/views of different levels of granularity: 1) a coarse-grained evaluation of modality benefit; 2) a medium-grained quantification of information flow; and 3) a fine-grained causality analysis. T o verify the effectiveness, we conduct a human evaluation on two popular benchmarks. Experimental results reveal three interesting findings that provide potential direction toward future research: 1) Ensembling multiple views is crucial for reliable automated analysis; 2) Automated analysis is prone to detector-induced fluctuations; and 3) Different views produce a higher agreement on modality-balanced samples but diverge on biased ones.
arXiv.org Artificial Intelligence
Nov-11-2025
- Country:
- Africa > Central African Republic
- Ombella-M'Poko > Bimbo (0.04)
- Asia > China
- Guangdong Province > Guangzhou (0.04)
- Heilongjiang Province > Harbin (0.04)
- Hong Kong (0.04)
- Europe > Netherlands
- North Holland > Amsterdam (0.04)
- North America > United States (0.04)
- Africa > Central African Republic
- Genre:
- Research Report (0.81)
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning > Neural Networks
- Deep Learning (0.68)
- Natural Language > Large Language Model (0.94)
- Representation & Reasoning (0.93)
- Vision (1.00)
- Machine Learning > Neural Networks
- Communications > Social Media (1.00)
- Data Science > Data Mining (1.00)
- Sensing and Signal Processing > Image Processing (0.92)
- Artificial Intelligence
- Information Technology