Reasoning Like Program Executors

Pi, Xinyu, Liu, Qian, Chen, Bei, Ziyadi, Morteza, Lin, Zeqi, Gao, Yan, Fu, Qiang, Lou, Jian-Guang, Chen, Weizhu

Jan-27-2022–arXiv.org Artificial Intelligence

Reasoning over natural language is a long-standing goal for the research community. However, studies have shown that existing language models are inadequate in reasoning. To address the issue, we present POET, a new pre-training paradigm. Through pre-training language models with programs and their execution results, POET empowers language models to harvest the reasoning knowledge possessed in program executors via a data-driven approach. POET is conceptually simple and can be instantiated by different kinds of programs. In this paper, we show three empirically powerful instances, i.e., POET-Math, POET-Logic, and POET-SQL. Experimental results on six benchmarks demonstrate that POET can significantly boost model performance on natural language reasoning, such as numerical reasoning, logical reasoning, and multi-hop reasoning. Taking the DROP benchmark as a representative example, POET improves the F1 metric of BART from 69.2% to 80.6%. Furthermore, POET shines in giant language models, pushing the F1 metric of T5-11B to 87.6% and achieving a new state-of-the-art performance on DROP. POET opens a new gate on reasoning-enhancement pre-training and we hope our analysis would shed light on the future research of reasoning like program executors.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Jan-27-2022

arXiv.org PDF

Add feedback

Country:
- Asia (0.68)
- North America > United States
  - Minnesota > Hennepin County > Minneapolis (0.14)

Genre:
- Research Report (1.00)

Industry:
- Education (0.47)

Technology:
- Information Technology > Artificial Intelligence
  - Cognitive Science > Problem Solving (0.46)
  - Machine Learning (1.00)
  - Natural Language (1.00)