CONCRETE: Improving Cross-lingual Fact-checking with Cross-lingual Retrieval
Huang, Kung-Hsiang, Zhai, ChengXiang, Ji, Heng
–arXiv.org Artificial Intelligence
Fact-checking has gained increasing attention due to the widespread of falsified information. Most fact-checking approaches focus on claims made in English only due to the data scarcity issue in other languages. The lack of fact-checking datasets in low-resource languages calls for an effective cross-lingual transfer technique for fact-checking. Additionally, trustworthy information in different languages can be complementary and helpful in verifying facts. To this end, we present the first fact-checking framework augmented with cross-lingual retrieval that aggregates evidence retrieved from multiple languages through a cross-lingual retriever. Given the absence of cross-lingual information retrieval datasets with claim-like queries, we train the retriever with our proposed Cross-lingual Inverse Cloze Task (X-ICT), a self-supervised algorithm that creates training instances by translating the title of a passage. The goal for X-ICT is to learn cross-lingual retrieval in which the model learns to identify the passage corresponding to a given translated title. On the X-Fact dataset, our approach achieves 2.23% absolute F1 improvement in the zero-shot cross-lingual setup over prior systems. The source code and data are publicly available at https://github.com/khuangaf/CONCRETE.
arXiv.org Artificial Intelligence
Sep-5-2022
- Country:
- North America
- Haiti (0.04)
- United States
- Washington > King County
- Seattle (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Illinois > Champaign County
- Urbana (0.04)
- Washington > King County
- Canada > British Columbia
- Europe
- Sweden (0.14)
- United Kingdom (0.04)
- Italy (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Finland > Uusimaa
- Helsinki (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- South Korea (0.04)
- Singapore (0.04)
- Malaysia (0.04)
- China > Hubei Province
- Wuhan (0.04)
- North America
- Genre:
- Research Report (1.00)
- Industry:
- Media > News (0.94)
- Government (0.94)
- Automobiles & Trucks (0.69)
- Transportation > Ground
- Road (0.47)
- Technology: