WIQA: A dataset for "What if..." reasoning over procedural text

Tandon, Niket, Mishra, Bhavana Dalvi, Sakaguchi, Keisuke, Bosselut, Antoine, Clark, Peter

Sep-10-2019–arXiv.org Artificial Intelligence

We introduce WIQA, the first large-scale dataset of "What if..." questions over procedural text. WIQA contains three parts: a collection of paragraphs each describing a process, e.g., beach erosion; a set of crowdsourced influence graphs for each paragraph, describing how one change a ffects another; and a large (40k) collection of "What if...?" multiple-choice questions derived from the graphs. For example, given a paragraph about beach erosion, would stormy weather result in more or less erosion (or have no e ff ect)? The task is to answer the questions, given their associated paragraph. WIQA contains three kinds of questions: perturbations to steps mentioned in the paragraph; external (out-of-paragraph) perturbations requiring commonsense knowledge; and irrelevant (no e ff ect) perturbations. We find that state-of-the-art models achieve 73.8% accuracy, well below the human performance of 96.3%. We analyze the challenges, in particular tracking chains of influences, and present the dataset as an open challenge to the community.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Sep-10-2019

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.84)

Industry:
- Education (0.48)

Technology:
- Information Technology
  - Communications > Social Media
    - Crowdsourcing (0.89)
  - Artificial Intelligence
    - Natural Language (1.00)
    - Machine Learning (1.00)
    - Representation & Reasoning > Qualitative Reasoning (0.68)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found