Think&Cite: Improving Attributed Text Generation with Self-Guided Tree Search and Progress Reward Modeling

Dec-19-2024–arXiv.org Artificial Intelligence

Despite their outstanding capabilities, large language models (LLMs) are prone to hallucination and producing factually incorrect information. This challenge has spurred efforts in attributed text generation, which prompts LLMs to generate content with supporting evidence. In this paper, we propose a novel framework, called Think&Cite, and formulate attributed text generation as a multi-step reasoning problem integrated with search. Specifically, we propose Self-Guided Monte Carlo Tree Search (SG-MCTS), which capitalizes on the self-reflection capability of LLMs to reflect on the intermediate states of MCTS for guiding the tree expansion process. To provide reliable and comprehensive feedback, we introduce Progress Reward Models to measure the progress of tree search from the root to the current state from two aspects, i.e., generation and attribution progress. We conduct extensive experiments on three datasets and the results show that our approach significantly outperforms baseline approaches.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

Dec-19-2024

arXiv.org PDF

Add feedback

Country:
- South America > Colombia (0.04)
- North America
  - United States
    - Texas (0.14)
    - Rocky Mountains (0.04)
    - Colorado > Gunnison County (0.04)
    - Washington > King County
      - Seattle (0.04)
    - Louisiana > Orleans Parish
      - New Orleans (0.04)
    - Florida > Miami-Dade County
      - Miami (0.04)
  - Canada
    - Rocky Mountains (0.04)
    - Ontario > Toronto (0.04)
- Europe
  - Austria > Vienna (0.14)
  - Germany > Berlin (0.04)
  - Italy > Tuscany
    - Florence (0.04)
- Asia
  - Singapore (0.04)
  - India (0.04)
  - Thailand > Bangkok
    - Bangkok (0.04)
  - Myanmar > Tanintharyi Region
    - Dawei (0.04)
  - Middle East
    - Jordan (0.04)
    - UAE > Abu Dhabi Emirate
      - Abu Dhabi (0.04)

Genre:
- Research Report (0.84)

Industry:
- Health & Medicine (0.68)
- Leisure & Entertainment
  - Sports > Football (0.94)
  - Games (0.67)

Technology:
- Information Technology > Artificial Intelligence
  - Representation & Reasoning > Search (1.00)
  - Natural Language > Large Language Model (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning (0.46)