WIQA: A dataset for "What if..." reasoning over procedural text
Tandon, Niket, Mishra, Bhavana Dalvi, Sakaguchi, Keisuke, Bosselut, Antoine, Clark, Peter
–arXiv.org Artificial Intelligence
We introduce WIQA, the first large-scale dataset of "What if..." questions over procedural text. WIQA contains three parts: a collection of paragraphs each describing a process, e.g., beach erosion; a set of crowdsourced influence graphs for each paragraph, describing how one change a ffects another; and a large (40k) collection of "What if...?" multiple-choice questions derived from the graphs. For example, given a paragraph about beach erosion, would stormy weather result in more or less erosion (or have no e ff ect)? The task is to answer the questions, given their associated paragraph. WIQA contains three kinds of questions: perturbations to steps mentioned in the paragraph; external (out-of-paragraph) perturbations requiring commonsense knowledge; and irrelevant (no e ff ect) perturbations. We find that state-of-the-art models achieve 73.8% accuracy, well below the human performance of 96.3%. We analyze the challenges, in particular tracking chains of influences, and present the dataset as an open challenge to the community.
arXiv.org Artificial Intelligence
Sep-10-2019
- Country:
- North America > United States > Texas > Travis County > Austin (0.04)
- Genre:
- Research Report (0.84)
- Industry:
- Education (0.48)
- Technology: