Penalty Decoding: Well Suppress the Self-Reinforcement Effect in Open-Ended Text Generation

Oct-23-2023–arXiv.org Artificial Intelligence

The decoding algorithm is critical for open-ended text generation, transforming latent representations into coherent and meaningful outputs. This paper investigates the self-reinforcement effect in text generation and the effectiveness of a repetition penalty to mitigate it. However, determining the optimal repetition penalty value is challenging. To tackle this, we propose a forgetting mechanism that disregards distant tokens, reducing the burden of penalty selection. In addition, we introduce a length penalty to address overly short sentences caused by excessive penalties. Our penalty decoding approach incorporating three strategies helps resolve issues with sampling methods deviating from factual information. Experimental results demonstrate the efficacy of our approach in generating high-quality sentences resembling human output.

penalty, repetition penalty, text generation, (13 more...)

arXiv.org Artificial Intelligence

Oct-23-2023

arXiv.org PDF

Add feedback

Country:
- South America
  - Bolivia (0.04)
  - Argentina (0.04)
- Oceania > Australia
  - Victoria > Melbourne (0.04)
- North America > United States
  - Hawaii > Honolulu County
    - Honolulu (0.04)
  - California
    - San Diego County > San Diego (0.04)
    - Los Angeles County > Long Beach (0.04)
- Europe
  - Austria (0.04)
  - United Kingdom > England
    - Cambridgeshire > Cambridge (0.04)
- Asia
  - Middle East > UAE
    - Abu Dhabi Emirate > Abu Dhabi (0.04)
  - China > Shanghai
    - Shanghai (0.05)
- Africa > Ethiopia
  - Addis Ababa > Addis Ababa (0.04)

Genre:
- Research Report > New Finding (0.48)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Large Language Model (0.97)
  - Machine Learning > Neural Networks
    - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found