Teaching Smaller Language Models To Generalise To Unseen Compositional Questions
Hartill, Tim, Tan, Neset, Witbrock, Michael, Riddle, Patricia J.
–arXiv.org Artificial Intelligence
We equip a smaller Language Model to generalise to answering challenging compositional questions that have not been seen in training. To do so we propose a combination of multitask supervised pretraining on up to 93 tasks designed to instill diverse reasoning abilities, and a dense retrieval system that aims to retrieve a set of evidential paragraph fragments. Recent progress in question-answering has been achieved either through prompting methods against very large pretrained Language Models in zero or few-shot fashion, or by fine-tuning smaller models, sometimes in conjunction with information retrieval. We focus on the less explored question of the extent to which zero-shot generalisation can be enabled in smaller models with retrieval against a corpus within which sufficient information to answer a particular question may not exist. We establish strong baselines in this setting for diverse evaluation datasets (StrategyQA, CommonsenseQA, IIRC, DROP, Musique and ARC-DA), and show that performance can be significantly improved by adding retrieval-augmented training datasets which are designed to expose our models to a variety of heuristic reasoning strategies such as weighing partial evidence or ignoring an irrelevant context.
arXiv.org Artificial Intelligence
Aug-20-2023
- Country:
- Oceania
- New Zealand > North Island
- Auckland Region > Auckland (0.04)
- Australia > Victoria
- Melbourne (0.04)
- New Zealand > North Island
- North America
- Dominican Republic (0.04)
- United States
- Washington > King County
- Seattle (0.04)
- New York > New York County
- New York City (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Washington > King County
- Canada > British Columbia
- Europe
- Hungary (0.04)
- Austria (0.04)
- Italy > Tuscany
- Florence (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- France > Provence-Alpes-Côte d'Azur
- Bouches-du-Rhône > Marseille (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia > China
- Hong Kong (0.04)
- Oceania
- Genre:
- Research Report
- New Finding (0.67)
- Experimental Study (0.46)
- Research Report
- Industry:
- Education (0.46)
- Leisure & Entertainment (0.46)
- Technology: