Discrepancy Detection at the Data Level: Toward Consistent Multilingual Question Answering
Calvo-Bartolomé, Lorena, Aldana, Valérie, Cantarero, Karla, de Mesa, Alonso Madroñal, Arenas-García, Jerónimo, Boyd-Graber, Jordan
–arXiv.org Artificial Intelligence
Multilingual question answering (QA) systems must ensure factual consistency across languages, especially for objective queries such as What is jaundice?, while also accounting for cultural variation in subjective responses. We propose MIND, a user-in-the-loop fact-checking pipeline to detect factual and cultural discrepancies in multilingual QA knowledge bases. MIND highlights divergent answers to culturally sensitive questions (e.g., Who assists in childbirth?) that vary by region and context. We evaluate MIND on a bilingual QA system in the maternal and infant health domain and release a dataset of bilingual questions annotated for factual and cultural inconsistencies. We further test MIND on datasets from other domains to assess generalization. In all cases, MIND reliably identifies inconsistencies, supporting the development of more culturally aware and factually consistent QA systems.
arXiv.org Artificial Intelligence
Oct-15-2025
- Country:
- Africa (0.04)
- Asia
- Europe
- Austria > Vienna (0.14)
- France (0.14)
- Germany (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Middle East > Malta (0.04)
- Spain > Galicia
- Madrid (0.04)
- United Kingdom > England (0.04)
- North America
- Canada
- Quebec > Capitale-Nationale Region
- Quebec City (0.04)
- Québec (0.04)
- Rocky Mountains (0.04)
- Quebec > Capitale-Nationale Region
- Mexico > Mexico City
- Mexico City (0.04)
- United States
- California (0.04)
- Massachusetts (0.04)
- Mississippi (0.04)
- Virginia (0.04)
- Florida > Pinellas County
- St. Petersburg (0.04)
- New Mexico > Bernalillo County
- Albuquerque (0.04)
- Rocky Mountains (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Maryland > Prince George's County
- College Park (0.04)
- Ohio > Franklin County
- Columbus (0.04)
- Canada
- Genre:
- Research Report > Experimental Study (1.00)
- Industry:
- Government > Regional Government
- Health & Medicine
- Consumer Health (1.00)
- Epidemiology (1.00)
- Pharmaceuticals & Biotechnology (1.00)
- Public Health (1.00)
- Therapeutic Area
- Cardiology/Vascular Diseases (1.00)
- Immunology (1.00)
- Infections and Infectious Diseases (1.00)
- Neurology (1.00)
- Obstetrics/Gynecology (1.00)
- Pediatrics/Neonatology (1.00)
- Technology: