Semi-supervised Sequence Learning

Oct-2-2025, 08:52:09 GMT–Neural Information Processing Systems

We present two approaches to use unlabeled data to improve Se quence Learning with recurrent networks. The first approach is to predict wha t comes next in a sequence, which is a language model in NLP . The second approa ch is to use a sequence autoencoder, which reads the input sequence into a vector and predicts the input sequence again. These two algorithms can be used as a "pretraining" algorithm for a later supervised sequence learning algorit hm. In other words, the parameters obtained from the pretraining step can then be us ed as a starting point for other supervised training models. In our experiments, w e find that long short term memory recurrent networks after pretrained with the tw o approaches become more stable to train and generalize better. With pretra ining, we were able to achieve strong performance in many classification tasks, su ch as text classification with IMDB, DBpedia or image recognition in CIFAR-10.

artificial intelligence, experiment, machine learning, (17 more...)

Neural Information Processing Systems

Oct-2-2025, 08:52:09 GMT

Conferences PDF

Add feedback

Country:
- North America
  - United States > California
    - Santa Clara County > Palo Alto (0.04)
  - Canada > Ontario
    - Toronto (0.14)
- Europe > Switzerland
  - Basel-City > Basel (0.04)

Genre:
- Research Report > New Finding (0.67)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)