CoNT: Contrastive Neural Text Generation
–Neural Information Processing Systems
Recently, contrastive learning attracts increasing interests in neural text generation as a new solution to alleviate the exposure bias problem. It introduces a sequence-level training signal which is crucial to generation tasks that always rely on auto-regressive decoding. However, previous methods using contrastive learning in neural text generation usually lead to inferior performance. In this paper, we analyse the underlying reasons and propose a new Contrastive Neural Text generation framework, CoNT. CoNT addresses bottlenecks that prevent contrastive learning from being widely adopted in generation tasks from three aspects -- the construction of contrastive examples, the choice of the contrastive loss, and the strategy in decoding. Experimental results show that CoNT clearly outperforms its baseline on all the ten benchmarks with a convincing margin.
Neural Information Processing Systems
Oct-9-2024, 17:20:35 GMT
- Technology: