Efficient Open Domain Multi-Hop Question Answering with Few-Shot Data Synthesis
Chen, Mingda, Chen, Xilun, Yih, Wen-tau
–arXiv.org Artificial Intelligence
Few-shot learning for open domain multi-hop question answering typically relies on large language models (LLMs). While powerful, LLMs are inefficient at the inference time. We propose a data synthesis framework for multi-hop question answering that allows for improving smaller language models with less than 10 human-annotated question answer pairs. The framework is built upon the data generation functions parameterized by LLMs and prompts, which requires minimal hand-crafted features. Empirically, we synthesize millions of multi-hop questions and claims. After finetuning language models on the synthetic data, we evaluate the models on popular benchmarks on multi-hop question answering and fact verification. Our experimental results show that finetuning on the synthetic data improves model performance significantly, allowing our finetuned models to be competitive with prior models while being almost one-third the size in terms of parameter counts.
arXiv.org Artificial Intelligence
May-23-2023
- Country:
- Oceania > New Zealand (0.04)
- South America > Peru (0.04)
- North America
- Dominican Republic (0.04)
- United States
- Colorado (0.07)
- Pennsylvania (0.04)
- Texas > Travis County
- Austin (0.04)
- Indiana
- Monroe County > Bloomington (0.04)
- Marion County > Indianapolis (0.04)
- South Carolina > Charleston County
- Charleston (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Illinois > Cook County
- Chicago (0.04)
- California > Los Angeles County
- Los Angeles (0.04)
- Washington > King County
- Seattle (0.04)
- New York > New York County
- New York City (0.04)
- Europe
- United Kingdom > England
- Merseyside > Liverpool (0.04)
- Greater London > London (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- United Kingdom > England
- Asia
- China > Hong Kong (0.04)
- Middle East > UAE
- Abu Dhabi Emirate > Abu Dhabi (0.04)
- Genre:
- Research Report > New Finding (0.88)
- Industry:
- Media
- Television (1.00)
- Film (1.00)
- Leisure & Entertainment > Sports
- Basketball (0.69)
- Soccer (0.46)
- Media
- Technology: