Optimus: Organizing Sentences via Pre-trained Modeling of a Latent Space

Li, Chunyuan, Gao, Xiang, Li, Yuan, Li, Xiujun, Peng, Baolin, Zhang, Yizhe, Gao, Jianfeng

Apr-5-2020–arXiv.org Machine Learning

When trained effectively, the Variational Autoencoder (VAE) can be both a powerful generative model and an effective representation learning framework for natural language. In this paper, we propose the first large-scale language VAE model, Optimus. A universal latent embedding space for sentences is first pre-trained on large text corpus, and then fine-tuned for various language generation and understanding tasks. Compared with GPT-2, Optimus enables guided language generation from an abstract level using the latent vectors. Compared with BERT, Optimus can generalize better on low-resource language understanding tasks due to the smooth latent space structure. Extensive experimental results on a wide range of language tasks demonstrate the effectiveness of Optimus. It achieves new state-of-the-art on VAE language modeling benchmarks. We hope that our first pre-trained big VAE language model itself and results can help the NLP community renew the interests of deep generative models in the era of large-scale pre-training, and make these principled methods more practical.

latent space, latexit latexit sha1, latexit sha1, (16 more...)

arXiv.org Machine Learning

Apr-5-2020

arXiv.org PDF

Add feedback

Country:
- Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Genre:
- Research Report (0.50)

Industry:
- Leisure & Entertainment > Sports (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language
    - Large Language Model (1.00)
    - Generation (1.00)
    - Chatbot (1.00)
  - Machine Learning > Neural Networks
    - Deep Learning > Generative AI (0.34)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found