Unsupervised Transfer Learning for Spoken Language Understanding in Intelligent Agents

Siddhant, Aditya, Goyal, Anuj, Metallinou, Angeliki

Nov-13-2018–arXiv.org Artificial Intelligence

User interaction with voice-powered agents generates large amounts of unlabeled utterances. In this paper, we explore techniques to efficiently transfer the knowledge from these unlabeled utterances to improve model performance on Spoken Language Understanding (SLU) tasks. We use Embeddings from Language Model (ELMo) to take advantage of unlabeled data by learning contextualized word representations. Additionally, we propose ELMo-Light (ELMoL), a faster and simpler unsupervised pre-training method for SLU. Our findings suggest unsupervised pre-training on a large corpora of unlabeled utterances leads to significantly better SLU performance compared to training from scratch and it can even outperform conventional supervised transfer. Additionally, we show that the gains from unsupervised transfer techniques can be further improved by supervised transfer. The improvements are more pronounced in low resource settings and when using only 1000 labeled in-domain samples, our techniques match the performance of training from scratch on 10-15x more labeled in-domain data.

architecture, deep learning, speech recognition, (22 more...)

arXiv.org Artificial Intelligence

Nov-13-2018

arXiv.org PDF

Add feedback

Genre:
- Research Report
  - Experimental Study (0.68)
  - New Finding (1.00)

Technology:
- Information Technology > Artificial Intelligence
  - Machine Learning
    - Neural Networks > Deep Learning (1.00)
    - Transfer Learning (0.86)
  - Natural Language (1.00)
  - Representation & Reasoning > Agents (1.00)
  - Speech > Speech Recognition (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found