Multimodal Misinformation Detection by Learning from Synthetic Data with Multimodal LLMs
Zeng, Fengzhu, Li, Wenqian, Gao, Wei, Pang, Yan
–arXiv.org Artificial Intelligence
Detecting multimodal misinformation, especially in the form of image-text pairs, is crucial. Obtaining large-scale, high-quality real-world fact-checking datasets for training detectors is costly, leading researchers to use synthetic datasets generated by AI technologies. However, the generalizability of detectors trained on synthetic data to real-world scenarios remains unclear due to the distribution gap. To address this, we propose learning from synthetic data for detecting real-world multimodal misinformation through two model-agnostic data selection methods that match synthetic and real-world data distributions. Experiments show that our method enhances the performance of a small MLLM (13B) on real-world fact-checking datasets, enabling it to even surpass GPT-4V~\cite{GPT-4V}.
arXiv.org Artificial Intelligence
Sep-29-2024
- Country:
- Africa > Rwanda
- Asia
- China > Hong Kong (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Russia (0.04)
- Singapore (0.04)
- Europe
- France
- Provence-Alpes-Côte d'Azur > Bouches-du-Rhône
- Marseille (0.04)
- Île-de-France > Paris
- Paris (0.14)
- Provence-Alpes-Côte d'Azur > Bouches-du-Rhône
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- Middle East > Republic of Türkiye
- Istanbul Province > Istanbul (0.04)
- Netherlands (0.04)
- Russia (0.04)
- United Kingdom > England
- Greater London > London (0.04)
- France
- North America
- Canada
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.04)
- Ontario > Toronto (0.04)
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Dominican Republic (0.04)
- United States
- New York > New York County
- New York City (0.05)
- California
- Los Angeles County > Los Angeles
- Hollywood > West Hollywood (0.04)
- Santa Clara County > Mountain View (0.04)
- Los Angeles County > Los Angeles
- District of Columbia > Washington (0.04)
- Washington > King County
- Seattle (0.04)
- Mississippi > Hinds County
- Jackson (0.04)
- Tennessee (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Arizona (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Virginia > Richmond (0.04)
- New York > New York County
- Canada
- South America > Peru (0.04)
- Genre:
- Research Report (0.82)
- Industry:
- Government > Regional Government
- Leisure & Entertainment > Sports
- Football (0.67)
- Media > News (1.00)
- Technology: