RobuT: A Systematic Study of Table QA Robustness Against Human-Annotated Adversarial Perturbations
Zhao, Yilun, Zhao, Chen, Nan, Linyong, Qi, Zhenting, Zhang, Wenlin, Tang, Xiangru, Mi, Boyu, Radev, Dragomir
–arXiv.org Artificial Intelligence
Despite significant progress having been made in question answering on tabular data (Table QA), it's unclear whether, and to what extent existing Table QA models are robust to task-specific perturbations, e.g., replacing key question entities or shuffling table columns. To systematically study the robustness of Table QA models, we propose a benchmark called RobuT, which builds upon existing Table QA datasets (WTQ, WikiSQL-Weak, and SQA) and includes human-annotated adversarial perturbations in terms of table header, table content, and question. Our results indicate that both state-of-the-art Table QA models and large language models (e.g., GPT-3) with few-shot learning falter in these adversarial sets. We propose to address this problem by using large language models to generate adversarial examples to enhance training, which significantly improves the robustness of Table QA models. Our data and code is publicly available at https://github.com/yilunzhao/RobuT.
arXiv.org Artificial Intelligence
Jun-25-2023
- Country:
- Africa > Middle East
- Egypt > Cairo Governorate > Cairo (0.04)
- Asia
- China > Beijing
- Beijing (0.04)
- Japan
- Honshū > Kantō
- Tokyo Metropolis Prefecture > Tokyo (0.14)
- Kyūshū & Okinawa > Kyūshū
- Fukuoka Prefecture > Fukuoka (0.04)
- Honshū > Kantō
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- South Korea > Seoul
- Seoul (0.04)
- China > Beijing
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Finland > Uusimaa
- Helsinki (0.04)
- France (0.04)
- Germany > Baden-Württemberg
- Stuttgart Region > Stuttgart (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Italy
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Sweden > Vaestra Goetaland
- Gothenburg (0.04)
- Belgium > Brussels-Capital Region
- North America
- Canada > British Columbia
- Dominican Republic (0.04)
- United States
- California > Los Angeles County
- Los Angeles (0.14)
- Kansas (0.04)
- New York (0.04)
- Washington > King County
- Seattle (0.04)
- California > Los Angeles County
- Oceania > Australia
- Queensland > Brisbane (0.04)
- Victoria > Melbourne (0.04)
- South America
- Argentina (0.04)
- Chile > Santiago Metropolitan Region
- Santiago Province > Santiago (0.04)
- Venezuela
- Capital District > Caracas (0.04)
- Táchira State > San Cristóbal (0.04)
- Zulia State > Maracaibo (0.04)
- Africa > Middle East
- Genre:
- Research Report > New Finding (0.88)
- Industry:
- Leisure & Entertainment > Sports (1.00)
- Technology: