Chain-of-Questions Training with Latent Answers for Robust Multistep Question Answering
Zhu, Wang, Thomason, Jesse, Jia, Robin
–arXiv.org Artificial Intelligence
We train a language model (LM) to robustly answer multistep questions by generating and answering sub-questions. We propose Chain-of-Questions, a framework that trains a model to generate sub-questions and sub-answers one at a time by leveraging human annotated question decomposition meaning representation (QDMR). The key technical challenge is that QDMR only contains sub-questions but not answers to those sub-questions, so we treat sub-answers as latent variables and optimize them using a novel dynamic mixture of Hard-EM and MAPO. Chain-of-Questions greatly outperforms strong neuro-symbolic methods by 9.0 F1 on DROP contrast set, and outperforms GPT-3.5 by 24.3 F1 on HOTPOTQA adversarial set, thus demonstrating the effectiveness and robustness of our framework.
arXiv.org Artificial Intelligence
Dec-23-2023
- Country:
- Africa > Togo (0.05)
- South America > Chile
- Oceania > Australia
- South Australia > Adelaide (0.04)
- North America
- Dominican Republic (0.04)
- Canada > Rocky Mountains (0.04)
- United States
- Montana (0.04)
- Rocky Mountains (0.04)
- Washington > King County
- Seattle (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.04)
- California > Los Angeles County
- Los Angeles (0.28)
- Europe
- United Kingdom > England (0.04)
- Italy > Tuscany
- Florence (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- China > Hong Kong (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Genre:
- Research Report (0.64)
- Industry:
- Government (0.47)
- Leisure & Entertainment (0.46)
- Technology: