Fast Structured Decoding for Sequence Models

Sun, Zhiqing, Li, Zhuohan, Wang, Haoqing, He, Di, Lin, Zi, Deng, Zhihong

Mar-18-2020, 21:33:23 GMT–Neural Information Processing Systems

Autoregressive sequence models achieve state-of-the-art performance in domains like machine translation. However, due to the autoregressive factorization nature, these models suffer from heavy latency during inference. Recently, non-autoregressive sequence models were proposed to speed up the inference time. However, these models assume that the decoding process of each token is conditionally independent of others. Such a generation process sometimes makes the output sentence inconsistent, and thus the learned non-autoregressive models could only achieve inferior accuracy compared to their autoregressive counterparts.

fast structured decoding, non-autoregressive model, non-autoregressive sequence model, (2 more...)

Neural Information Processing Systems

Mar-18-2020, 21:33:23 GMT

Conferences Web Page

Add feedback

Technology:
- Information Technology > Artificial Intelligence > Natural Language > Machine Translation (0.47)