Unsupervised Learning of Sentence Representations Using Sequence Consistency

Sep-29-2018–arXiv.org Artificial Intelligence

Computing universal distributed representations of sentences is a fundamental task in natural language processing. We propose a simple, yet surprisingly powerful unsupervised method to learn such representations by enforcing consistency constraints on sequences of tokens. We consider two classes of such constraints - sequences that form a sentence and between two sequences that form a sentence when merged. We learn a sentence encoder by training it to distinguish between consistent and inconsistent examples. Extensive evaluation on several transfer learning and linguistic probing tasks shows improved performance over strong unsupervised and supervised baselines, substantially surpassing them in several cases. In natural language processing, the use of distributed representations has become standard through the effective use of word embeddings.

artificial intelligence, machine learning, natural language, (22 more...)

arXiv.org Artificial Intelligence

Sep-29-2018

arXiv.org PDF

Add feedback

Genre:
- Research Report (0.51)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language > Text Processing (0.84)
  - Machine Learning
    - Neural Networks (0.95)
    - Inductive Learning (0.94)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found