Syntax-Infused Transformer and BERT models for Machine Translation and Natural Language Understanding
Sundararaman, Dhanasekar, Subramanian, Vivek, Wang, Guoyin, Si, Shijing, Shen, Dinghan, Wang, Dong, Carin, Lawrence
Attention-based models have shown significant improvement over traditional algorithms in several NLP tasks. The Transformer, for instance, is an illustrative example that generates abstract representations of tokens inputted to an encoder based on their relationships to all tokens in a sequence. Recent studies have shown that although such models are capable of learning syntactic features purely by seeing examples, explicitly feeding this information to deep learning models can significantly enhance their performance. Leveraging syntactic information like part of speech (POS) may be particularly beneficial in limited training data settings for complex models such as the Transformer. We show that the syntax-infused Transformer with multiple features achieves an improvement of 0.7 BLEU when trained on the full WMT '14 English to German translation dataset and a maximum improvement of 1.99 BLEU points when trained on a fraction of the dataset. In addition, we find that the incorporation of syntax into BERT fine-tuning outperforms baseline on a number of downstream tasks from the GLUE benchmark. Introduction Attention-based deep learning models for natural language processing (NLP) have shown promise for a variety of machine translation and natural language understanding tasks. For word-level, sequence-to-sequence tasks such as translation, paraphrasing, and text summarization, attention-based models allow a single token ( e.g., a word or subword) in a sequence to be represented as a combination of all tokens in the sequence (Luong, Pham, and Manning, 2015). The distributed context allows attention-based models to infer rich representations for tokens, leading to more robust performance.
Nov-9-2019
- Country:
- Asia > China (0.04)
- North America > United States
- North Carolina > Durham County > Durham (0.04)
- Europe > Italy
- Africa
- Sub-Saharan Africa (0.04)
- Nigeria (0.04)
- Genre:
- Research Report (0.70)
- Technology: