KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detection

Choi, Sehyun, Fang, Tianqing, Wang, Zhaowei, Song, Yangqiu

Oct-13-2023–arXiv.org Artificial Intelligence

Large Language Models (LLMs) have demonstrated remarkable human-level natural language generation capabilities. However, their potential to generate misinformation, often called the hallucination problem, poses a significant risk to their deployment. A common approach to address this issue is to retrieve relevant knowledge and fine-tune the LLM with the knowledge in its input. Unfortunately, this method incurs high training costs and may cause catastrophic forgetting for multi-tasking models. To overcome these limitations, we propose a knowledge-constrained decoding method called KCTS (Knowledge-Constrained Tree Search), which guides a frozen LM to generate text aligned with the reference knowledge at each decoding step using a knowledge classifier score and MCTS (Monte-Carlo Tree Search). To adapt the sequence-level knowledge classifier to token-level guidance, we also propose a novel token-level hallucination detection method called RIPA (Reward Inflection Point Approximation). Our empirical results on knowledge-grounded dialogue and abstractive summarization demonstrate the strength of KCTS as a plug-and-play, model-agnostic decoding method that can effectively reduce hallucinations in natural language generation.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

Oct-13-2023

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East
  - UAE (0.14)
- Europe (1.00)
- North America > United States
  - Louisiana (0.14)
  - Michigan (0.14)
  - Mississippi (0.14)

Genre:
- Research Report (1.00)

Industry:
- Health & Medicine > Consumer Health (0.93)
- Leisure & Entertainment > Sports
  - Soccer (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.48)
  - Natural Language
    - Generation (1.00)
    - Large Language Model (1.00)
  - Representation & Reasoning > Search (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found