Explain and Predict, and then Predict again

Zhang, Zijian, Rudra, Koustav, Anand, Avishek

Jan-11-2021–arXiv.org Artificial Intelligence

A desirable property of learning systems is to be both effective and interpretable. Towards this goal, recent models have been proposed that first generate an extractive explanation from the input text and then generate a prediction on just the explanation called explain-then-predict models. These models primarily consider the task input as a supervision signal in learning an extractive explanation and do not effectively integrate rationales data as an additional inductive bias to improve task performance. We propose a novel yet simple approach ExPred, that uses multi-task learning in the explanation generation phase effectively trading-off explanation and prediction losses. And then we use another prediction network on just the extracted explanations for optimizing the task performance. We conduct an extensive evaluation of our approach on three diverse language datasets -- fact verification, sentiment classification, and QA -- and find that we substantially outperform existing approaches.

explanation, prediction, rationale, (14 more...)

arXiv.org Artificial Intelligence

Jan-11-2021

arXiv.org PDF

Add feedback

Country:
- Pacific Ocean > North Pacific Ocean
  - San Francisco Bay (0.04)
- North America > United States
  - New York > New York County
    - New York City (0.04)
  - California
    - San Francisco County > San Francisco (0.04)
    - San Diego County > San Diego (0.04)
    - Los Angeles County > Los Angeles (0.04)
- Europe > Germany
  - Lower Saxony > Hanover (0.04)
- Asia > Middle East
  - Israel (0.04)

Genre:
- Research Report (1.00)

Industry:
- Media (0.48)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning (1.00)
  - Natural Language > Text Classification (0.66)
  - Machine Learning > Neural Networks
    - Deep Learning (0.69)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found