Query Refinement Prompts for Closed-Book Long-Form Question Answering
Amplayo, Reinald Kim, Webster, Kellie, Collins, Michael, Das, Dipanjan, Narayan, Shashi
–arXiv.org Artificial Intelligence
Large language models (LLMs) have been shown to perform well in answering questions and in producing long-form texts, both in few-shot closed-book settings. While the former can be validated using well-known evaluation metrics, the latter is difficult to evaluate. We resolve the difficulties to evaluate long-form output by doing both tasks at once -- to do question answering that requires long-form answers. Such questions tend to be multifaceted, i.e., they may have ambiguities and/or require information from multiple sources. To this end, we define query refinement prompts that encourage LLMs to explicitly express the multifacetedness in questions and generate long-form answers covering multiple facets of the question. Our experiments on two long-form question answering datasets, ASQA and AQuAMuSe, show that using our prompts allows us to outperform fully finetuned models in the closed book setting, as well as achieve results comparable to retrieve-then-generate open-book models.
arXiv.org Artificial Intelligence
Oct-31-2022
- Country:
- Asia
- China > Hong Kong (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
- Middle East > Jordan (0.04)
- North Korea (0.04)
- South Korea (0.04)
- Thailand > Bangkok
- Bangkok (0.04)
- Europe
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Italy > Tuscany
- Florence (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- United Kingdom > England
- West Yorkshire > Bradford (0.04)
- Belgium > Brussels-Capital Region
- North America
- Canada > British Columbia
- Dominican Republic (0.04)
- United States
- Texas > Travis County
- Austin (0.04)
- Washington > King County
- Seattle (0.04)
- Wisconsin > Racine County
- Racine (0.04)
- Texas > Travis County
- Oceania > Australia
- South America > Chile
- Asia
- Genre:
- Research Report (0.50)
- Industry:
- Leisure & Entertainment (1.00)
- Media (1.00)
- Technology: