Faithful Reasoning Using Large Language Models
Creswell, Antonia, Shanahan, Murray
–arXiv.org Artificial Intelligence
Although contemporary large language models (LMs) demonstrate impressive question-answering capabilities, their answers are typically the product of a single call to the model. This entails an unwelcome degree of opacity and compromises performance, especially on problems that are inherently multi-step. To address these limitations, we show how LMs can be made to perform faithful multi-step reasoning via a process whose causal structure mirrors the underlying logical structure of the problem. Our approach works by chaining together reasoning steps, where each step results from calls to two fine-tuned LMs, one for selection and one for inference, to produce a valid reasoning trace. Our method carries out a beam search through the space of reasoning traces to improve reasoning quality. We demonstrate the effectiveness of our model on multi-step logical deduction and scientific question-answering, showing that it outperforms baselines on final answer accuracy, and generates humanly interpretable reasoning traces whose validity can be checked by the user.
arXiv.org Artificial Intelligence
Aug-30-2022
- Country:
- North America
- Dominican Republic (0.04)
- United States
- Maryland (0.04)
- New York > New York County
- New York City (0.04)
- Europe
- Netherlands (0.04)
- Ireland (0.04)
- North America
- Genre:
- Research Report (0.81)
- Industry:
- Materials > Metals & Mining (1.00)
- Transportation (0.93)
- Energy > Renewable
- Ocean Energy (0.69)
- Education > Curriculum
- Subject-Specific Education (0.45)
- Technology: