Compositional Questions Do Not Necessitate Multi-hop Reasoning
Min, Sewon, Wallace, Eric, Singh, Sameer, Gardner, Matt, Hajishirzi, Hannaneh, Zettlemoyer, Luke
–arXiv.org Artificial Intelligence
Multi-hop reading comprehension (RC) questions are challenging because they require reading and reasoning over multiple paragraphs. We argue that it can be difficult to construct large multi-hop RC datasets. For example, even highly compositional questions can be answered with a single hop if they target specific entity types, or the facts needed to answer them are redundant. Our analysis is centered on HotpotQA, where we show that single-hop reasoning can solve much more of the dataset than previously thought. We introduce a single-hop BERT-based RC model that achieves 67 F1---comparable to state-of-the-art multi-hop models. We also design an evaluation setting where humans are not shown all of the necessary paragraphs for the intended multi-hop reasoning but can still answer over 80% of questions. Together with detailed error analysis, these results suggest there should be an increasing focus on the role of evidence in multi-hop reasoning and possibly even a shift towards information retrieval style evaluations with large and diverse evidence collections.
arXiv.org Artificial Intelligence
Jun-7-2019
- Country:
- Africa > Democratic Republic of the Congo (0.04)
- Europe
- France
- Hauts-de-France > Pas-de-Calais (0.04)
- Provence-Alpes-Côte d'Azur > Alpes-de-Haute-Provence (0.04)
- United Kingdom > England (0.04)
- France
- North America
- Guadeloupe (0.05)
- Saint Barthélemy (0.05)
- Saint Martin (0.04)
- United States
- California > Orange County
- Irvine (0.04)
- Louisiana > Avoyelles Parish (0.04)
- Missouri > Jackson County
- Kansas City (0.04)
- California > Orange County
- Genre:
- Research Report > New Finding (0.34)
- Industry:
- Education > Assessment & Standards > Student Performance (0.35)
- Technology: