DiffuSeq: Sequence to Sequence Text Generation with Diffusion Models

Gong, Shansan, Li, Mukai, Feng, Jiangtao, Wu, Zhiyong, Kong, Lingpeng

Feb-14-2023–arXiv.org Artificial Intelligence

Recently, diffusion models have emerged as a new paradigm for generative models. Despite the success in domains using continuous signals such as vision and audio, adapting diffusion models to natural language is under-explored due to the discrete nature of texts, especially for conditional generation. We tackle this challenge by proposing DiffuSeq: a diffusion model designed for sequence-to-sequence (Seq2Seq) text generation tasks. Upon extensive evaluation over a wide range of Seq2Seq tasks, we find DiffuSeq achieving comparable or even better performance than six established baselines, including a state-of-the-art model that is based on pre-trained language models. Apart from quality, an intriguing property of DiffuSeq is its high diversity during generation, which is desired in many Seq2Seq tasks. We further include a theoretical analysis revealing the connection between DiffuSeq and autoregressive/non-autoregressive models. Bringing together theoretical analysis and empirical evidence, we demonstrate the great potential of diffusion models in complex conditional language generation tasks. Code is available at \url{https://github.com/Shark-NLP/DiffuSeq}

diffusion model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

Feb-14-2023

arXiv.org PDF

Add feedback

Country:
- Europe > Italy
  - Calabria > Catanzaro Province > Catanzaro (0.04)
- Asia
  - Japan (0.05)
  - China
    - Shanghai > Shanghai (0.04)
    - Hong Kong (0.04)

Genre:
- Research Report
  - Promising Solution (0.48)
  - New Finding (0.46)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Generation (0.68)
  - Representation & Reasoning > Uncertainty (0.68)
  - Machine Learning > Neural Networks
    - Deep Learning (0.95)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found