Trusting Your Evidence: Hallucinate Less with Context-aware Decoding

Shi, Weijia, Han, Xiaochuang, Lewis, Mike, Tsvetkov, Yulia, Zettlemoyer, Luke, Yih, Scott Wen-tau

May-24-2023–arXiv.org Artificial Intelligence

Language models (LMs) often struggle to pay enough attention to the input context, and generate texts that are unfaithful or contain hallucinations. To mitigate this issue, we present context-aware decoding (CAD), which follows a contrastive output distribution that amplifies the difference between the output probabilities when a model is used with and without context. Our experiments show that CAD, without additional training, significantly improves the faithfulness of different LM families, including OPT, GPT, LLaMA and FLAN-T5 for summarization tasks (e.g., 14.3% gain for LLaMA in factuality metrics). Furthermore, CAD is particularly effective in overriding a model's prior knowledge when it contradicts the provided context, leading to substantial improvements in tasks where resolving the knowledge conflict is essential.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

May-24-2023

arXiv.org PDF

Add feedback

Country:
- Asia > Middle East
  - UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
- Europe
  - Belgium > Brussels-Capital Region
    - Brussels (0.04)
  - Ireland > Connacht
    - County Mayo (0.04)
  - Spain > Catalonia
    - Barcelona Province > Barcelona (0.04)
- North America > United States
  - Washington > King County > Seattle (0.14)
- South America > Argentina (0.05)

Genre:
- Research Report (0.64)

Industry:
- Leisure & Entertainment (1.00)
- Media > Film (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning > Neural Networks
    - Deep Learning (0.47)
  - Natural Language > Large Language Model (0.69)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found