Self-Evaluation Guided Beam Search for Reasoning

May-25-2025, 03:13:20 GMT–Neural Information Processing Systems

Breaking down a problem into intermediate steps has demonstrated impressive performance in Large Language Model (LLM) reasoning. However, the growth of the reasoning chain introduces uncertainty and error accumulation, making it challenging to elicit accurate final results. To tackle this challenge of uncertainty in multi-step reasoning, we introduce a stepwise self-evaluation mechanism to guide and calibrate the reasoning process of LLMs. We propose a decoding algorithm integrating the self-evaluation guidance via stochastic beam search. The selfevaluation guidance serves as a better-calibrated automatic criterion, facilitating an efficient search in the reasoning space and resulting in superior prediction quality.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

May-25-2025, 03:13:20 GMT

Conferences PDF

Add feedback

Country:
- Asia (0.67)
- North America > United States
  - California (0.14)
  - Hawaii (0.14)
  - Minnesota > Hennepin County
    - Minneapolis (0.14)

Genre:
- Workflow (0.46)

Industry:
- Energy > Oil & Gas (0.68)
- Leisure & Entertainment (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Cognitive Science > Problem Solving (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.70)
  - Natural Language > Large Language Model (1.00)
  - Representation & Reasoning (1.00)

Duplicate Docs Excel Report

Title
Self-Evaluation Guided Beam Search for Reasoning Min-Yen Kan

Similar Docs Excel Report more

Title	Similarity	Source
None found