Towards Robust Extractive Question Answering Models: Rethinking the Training Methodology
Tran, Son Quoc, Kretchmar, Matt
–arXiv.org Artificial Intelligence
This paper proposes a novel training method to improve the robustness of Extractive Question Answering (EQA) models. Previous research has shown that existing models, when trained on EQA datasets that include unanswerable questions, demonstrate a significant lack of robustness against distribution shifts and adversarial attacks. Despite this, the inclusion of unanswerable questions in EQA training datasets is essential for ensuring real-world reliability. Our proposed training method includes a novel loss function for the EQA problem and challenges an implicit assumption present in numerous EQA datasets. Models trained with our method maintain in-domain performance while achieving a notable improvement on out-of-domain datasets. This results in an overall F1 score improvement of 5.7 across all testing sets. Furthermore, our models exhibit significantly enhanced robustness against two types of adversarial attacks, with a performance decrease of only about a third compared to the default models.
arXiv.org Artificial Intelligence
Sep-29-2024
- Country:
- Asia
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Belgium > Brussels-Capital Region
- North America
- Dominican Republic (0.04)
- Mexico (0.04)
- United States
- Arizona (0.04)
- Colorado (0.05)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Nevada (0.04)
- New York > Tompkins County
- Ithaca (0.04)
- Texas (0.04)
- Washington > King County
- Seattle (0.04)
- Oceania > Australia
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Government > Military (0.56)
- Information Technology > Security & Privacy (0.70)
- Technology: